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Preface to the Chemistry of Electronic Materials 


The intention of this text is not to provide a comprehensive reference to all 
aspects of semiconductor device fabrication or a review of research results 
that, irrespective of their promise, have not been adopted into mainstream 
production. Instead it is aimed to provide a useful reference for those 
interested in the chemical aspects of the electronics industry. 


Given the nature of Connexions, this course is fluid in structure and 
content. In addition, it contains modules by other authors where 
appropriate. The content will be updated and expanded with time. If any 
authors have suitable content, please contact me and I will be glad to assist 
in transforming the content to a suitable module structure. 


Andrew R. Barron 


Rice University, Houston, TX 77005. E-mail: arb@rice.edu 


Introduction to Semiconductors 
Introduction to semiconductors, mainly looking at the behavior of electrons 
in a solid from a quantum mechanical point of view. 


Note:This module is adapted from the Connexions module entitled 
Introduction to Semiconductors by Bill Wilson. 


If we only had to worry about simple conductors, life would not be very 
complicated, but on the other hand we wouldn't be able to make computers, 
CD players, cell phones, i-Pods and a lot of other things which we have 
found to be useful. We will now move on, and talk about another class of 
conductors called semiconductors. 


In order to understand semiconductors and in fact to get a more accurate 
picture of how metals, or normal conductors actually work, we really have 
to resort to quantum mechanics. Electrons in a solid are very tiny objects, 
and it turns out that when things get small enough, they no longer exactly 
following the classical "Newtonian" laws of physics that we are all familiar 
with from everyday experience. It is not the purpose of this course to teach 
quantum mechanics, so what we are going to do instead is describe the 
results which come from looking at the behavior of electrons in a solid from 
a quantum mechanical point of view. 


Solids (at least the ones we will be talking about, and especially 
semiconductors) are crystalline materials, which means that they have their 
atoms arranged in a ordered fashion. We can take silicon (the most 
important semiconductor) as an example. Silicon is a group 14(IV) element, 
which means it has four electrons in its outer or valence shell. Silicon 
crystallizes in a structure called the diamond crystal lattice, shown in [link]. 
Each silicon atom has four covalent bonds, arranged in a tetrahedral 
formation about the atom center. 


Crystal structure of 
silicon. 


In two dimensions, we can schematically represent a piece of single-crystal 
silicon as shown in [link]. Each silicon atom shares its four valence 
electrons with valence electrons from four nearest neighbors, filling the 
shell to 8 electrons, and forming a stable, periodic structure. Once the atoms 
have been arranged like this, the outer valence electrons are no longer 
strongly bound to the host atom. The outer shells of all of the atoms blend 
together and form what is called a band. The electrons are now free to move 
about within this band, and this can lead to electrical conductivity as we 
discussed earlier. 
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A 2-D representation of a 
silicon crystal. 


This is not the complete story however, for it turns out that due to quantum 
mechanical effects, there is not just one band which holds electrons, but 
several of them. What will follow is a very qualitative picture of how the 
electrons are distributed when they are in a periodic solid, and there are 
necessarily some details which we will be forced to gloss over. On the other 
hand this will give you a pretty good picture of what is going on, and may 
enable you to have some understanding of how a semiconductor really 
works. Electrons are not only distributed throughout the solid crystal 
spatially, but they also have a distribution in energy as well. The potential 
energy function within the solid is periodic in nature. This potential 
function comes from the positively charged atomic nuclei which are 
arranged in the crystal in a regular array. A detailed analysis of how 
electron wave functions, the mathematical abstraction which one must use 
to describe how small quantum mechanical objects behave when they are in 
a periodic potential, gives rise to an energy distribution somewhat like that 
shown in [link]. 
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Band Gap 


Schematic of the first two 
bands in a periodic solid 
showing energy levels 
and bands. 


Firstly, unlike the case for free electrons, in a periodic solid, electrons are 
not free to take on any energy value they wish. They are forced into specific 
energy levels called allowed states, which are represented by the cups in 
[link]. The allowed states are not distributed uniformly in energy either. 
They are grouped into specific configurations called energy bands. There 
are no allowed levels at zero energy and for some distance above that. 
Moving up from zero energy, we then encounter the first energy band. At 
the bottom of the band there are very few allowed states, but as we move up 
in energy, the number of allowed states first increases, and then falls off 
again. We then come to a region with no allowed states, called an energy 
band gap. Above the band gap, another band of allowed states exists. This 
goes on and on, with any given material having many such bands and band 
gaps. This situation is shown schematically in [link], where the small cups 
represent allowed energy levels, and the vertical axis represents electron 
energy. 


It turns out that each band has exactly 2N allowed states in it, where N is the 
total number of atoms in the particular crystal sample we are talking about. 
(Since there are 10 cups in each band in the figure, it must represent a 
crystal with just 5 atoms in it. Not a very big crystal at all!) Into these bands 
we must now distribute all of the valence electrons associated with the 
atoms, with the restriction that we can only put one electron into each 
allowed state. This is the result of something called the Pauli exclusion 
principle. Since in the case of silicon there are 4 valence electrons per atom, 
we would just fill up the first two bands, and the next would be empty. If 
we make the logical assumption that the electrons will fill in the levels with 
the lowest energy first, and only go into higher lying levels if the ones 
below are already filled. This situation is shown in [link], in which we have 
represented electrons as small black balls with a "-" sign on them. Indeed, 
the first two bands are completely full, and the next is empty. What will 
happen if we apply an electric field to the sample of silicon? Remember the 
diagram we have at hand right now is an energy based one, we are showing 
how the electrons are distributed in energy, not how they are arranged 
spatially. On this diagram we can not show how they will move about, but 
only how they will change their energy as a result of the applied field. The 


electric field will exert a force on the electrons and attempt to accelerate 
them. If the electrons are accelerated, then they must increase their kinetic 
energy. Unfortunately, there are no empty allowed states in either of the 
filled bands. An electron would have to jump all the way up into the next 
(empty) band in order to take on more energy. In silicon, the gap between 
the top of the highest most occupied band and the lowest unoccupied band 
is 1.1 eV. (One eV is the potential energy gained by an electron moving 
across an electrical potential of one volt.) The mean free path or distance 
over which an electron would normally move before it suffers a collision is 
only a few hundred angstroms (ca. 300 x 10°8 cm) and so you would need a 
very large electric field (several hundred thousand V/cm) in order for the 
electron to pick up enough energy to "jump the gap". This makes it appear 
that silicon would be a very bad conductor of electricity, and in fact, very 
pure silicon is very poor electrical conductor. 
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A metal is an element with an odd number of valence electrons so that a 
metal ends up with an upper band which is just half full of electrons. This is 
illustrated in [link]. Here we see that one band is full, and the next is just 
half full. This would be the situation for the Group 13(III) element 
aluminum for instance. If we apply an electric field to these carriers, those 
near the top of the distribution can indeed move into higher energy levels 
by acquiring some kinetic energy of motion, and easily move from one 
place to the next. In reality, the whole situation is a bit more complex than 
we have shown here, but this is not too far from how it actually works. 
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So, back to our silicon sample. If there are no places for electrons to "move' 
into, then how does silicon work as a "Semiconductor"? Well, in the first 
place, it turns out that not all of the electrons are in the bottom two bands. 
In silicon, unlike say quartz or diamond, the band gap between the top-most 
full band, the next empty one is not so large. As we mentioned above it is 
only about 1.1 eV. So long as the silicon is not at absolute zero temperature, 
some electrons near the top of the full band can acquire enough thermal 
energy that they can "hop" the gap, and end up in the upper band, called the 
conduction band. This situation is shown in [link]. 
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Thermal 
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across the 
band gap. 


In silicon at room temperature, roughly 10!” electrons per cubic centimeter 
are thermally excited across the band-gap at any one time. It should be 
noted that the excitation process is a continuous one. Electrons are being 
excited across the band, but then they fall back down into empty spots in 
the lower band. On average however, the 10!° in each cm? of silicon is what 
you will find at any given instant. Now 10 billion electrons per cubic 
centimeter seems like a lot of electrons, but lets do a simple calculation. 
The mobility of electrons in silicon is about 1000 cm?/V.s. Remember, 
mobility times electric field yields the average velocity of the carriers. 
Electric field has units of V/cm, so with these units we get velocity in cm/s 
as we should. The charge on an electron is 1.6 x 10°! coulombs. Thus from 
[link]: 

Equation: 


o = nqu 
= 10!° (1.6 x 10°!) 1000 
= 1.6x 10° mhos/em 


If we have a sample of silicon 1 cm long by (1 mm x 1mm) square, it would 
have a resistance, [link], which does not make it much of a "conductor". In 
fact, if this were all there was to the silicon story, we could pack up and 
move on, because at any reasonable temperature, silicon would conduct 
electricity very poorly. 

Equation: 


R =L/oA 
1/(1.6 x 10°°)(0.1)? 
1.6 x 10° MQ 


Doped Semiconductors 
From the silicon's crystal structure to discuss how to make doped 
semiconductors and the mechanics. 


Note:This module is adapted from the Connexions module entitled Doped 
Semiconductors by Bill Wilson. 


To see how we can make silicon a useful electronic material, we will have 
to go back to its crystal structure ([link]). Suppose somehow we could 
substitute a few atoms of phosphorus for some of the silicon atoms. 
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A two dimensional 
representation of a 
silicon crystal lattice 
"doped" with 
phosphorus. 


If you sneak a look at the periodic table, you will see that phosphorus is a 
group V element, as compared with silicon which is a group 14(IV) 
element. What this means is the phosphorus atom has five outer or valence 
electrons, instead of the four which silicon has. In a lattice composed 
mainly of silicon, the extra electron associated with the phosphorus atom 
has no "mating" electron with which it can complete a shell, and so is left 
loosely dangling to the phosphorus atom, with relatively low binding 
energy. In fact, with the addition of just a little thermal energy (from the 


natural or latent heat of the crystal lattice) this electron can break free and 
be left to wander around the silicon atom freely. In our "energy band" 
picture, we have something like what we see in [link]. The phosphorus 
atoms are represented by the added cups with P's on them. They are new 
allowed energy levels which are formed within the "band gap" near the 
bottom of the first empty band. They are located close enough to the empty 
(or "conduction") band, so that the electrons which they contain are easily 
excited up into the conduction band. There, they are free to move about and 
contribute to the electrical conductivity of the sample. Note also, however, 
that since the electron has left the vicinity of the phosphorus atom, there is 
now one more proton than there are electrons at the atom, and hence it has a 
net positive charge of 1q. We have represented this by putting a little "+" 
sign in each P-cup. Note that this positive charge is fixed at the site of the 
phosphorous atom called a donor since it "donates" an electron up into the 
conduction band, and is not free to move about in the crystal. 
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Silicon doped with 
phosphorus. 


How many phosphorus atoms do we need to significantly change the 
resistance of our silicon? Suppose we wanted our 1 mm by 1 mm square 
sample to have a resistance of one ohm as opposed to more than 60 MQ. 


Turning the resistance equation around we get, [link]. And hence, if we 
continue to assume an electron mobility of 1000 cm?/volt.sec, [link]. 
Equation: 


o = L/RA 

= 1Q/1 x (0.1) 

= 100 mho/cm 
Equation: 
n = O/qu 


100/(1.6 x 10°!°)1000 
= 6.25 x 10!7 cm? 


Now adding more than 6 x 10!” phosphorus atoms per cubic centimeter 
might seem like a lot of phosphorus, until you realize that there are almost 
10*4 silicon atoms in a cubic centimeter and hence only one in every 1.6 
million silicon atoms has to be changed into a phosphorus one to reduce the 
resistance of the sample from several 10s of MQ down to only one Q. This 
is the real power of semiconductors. You can make dramatic changes in 
their electrical properties by the addition of only minute amounts of 
impurities. This process is called doping the semiconductor. It is also one of 
the great challenges of the semiconductor manufacturing industry, for it is 
necessary to maintain fantastic levels of control of the impurities in the 
material in order to predict and control their electrical properties. 


Again, if this were the end of the story, we still would not have any 
calculators, cell phones, or stereos, or at least they would be very big and 
cumbersome and unreliable, because they would have to work using 
vacuum tubes. We now have to focus on the few "empty" spots in the lower, 
almost full band (called the valence band.) We will take another view of 
this band, from a somewhat different perspective. I must confess at this 
point that what I am giving you is even further from the way things really 
work, then the "cups at different energies" picture we have been using so 
far. The problem is, that in order to do things right, we have to get involved 
in momentum phase-space, a lot more quantum mechanics, and generally a 
bunch of math and concepts we don't really need in order to have some idea 


of how semiconductor devices work. What follow below is really intended 
as a motivation, so that you will have some feeling that what we state as 
results, is actually reasonable. 


Consider [link]. Here we show all of the electrons in the valence, or almost 
full band, and for simplicity show one missing electron. Let's apply an 
electric field, as shown by the arrow in the figure. The field will try to move 
the (negatively charged) electrons to the left, but since the band is almost 
completely full, the only one that can move is the one right next to the 
ony spot, or hole as it is called. 
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Band full of electrons, 
with one missing. 


One thing you may be worrying about is what happens to the electrons at 
the ends of the sample. This is one of the places where we are getting a 
somewhat distorted view of things, because we should really be looking in 
momentum, or wave-vector space rather than "real" space. In that picture, 
they magically drop off one side and "reappear" on the other. This doesn't 
happen in real space of course, so there is no easy way we can deal with it. 


A short time after we apply the electric field we have the situation shown in 
[link], and a little while after that we have [link]. We can interpret this 
motion in two ways. One is that we have a net flow of negative charge to 
the left, or if we consider the effect of the aggregate of all the electrons in 
the band we could picture what is going on as a single positive charge, 
moving to the right. This is shown in [link]. Note that in either view we 
have the same net effect in the way the total net charge is transported 


through the sample. In the mostly negative charge picture, we have a net 
flow of negative charge to the left. In the single positive charge picture, we 
have a net flow of positive charge to the right. Both give the same sign for 
the current! 


E 


Motion of the 
"missing" electron 
with an electric field. 


Further motion of the 
"missing electron" 
spot. 


Motion of a "hole" 
due to an applied 
electric field. 


Thus, it turns out, we can consider the consequences of the empty spaces 
moving through the co-ordinated motion of electrons in an almost full band 
as being the motion of positive charges, moving wherever these empty 
spaces happen to be. We call these charge carriers "holes" and they too can 
add to the total conduction of electricity in a semiconductor. Using p to 
represent the density (in cm” of spaces in the valence band and py, and pip, to 
represent the mobility of electrons and holes respectively (they are usually 
not the same) we can modify to give the conductivity 0, when both 
electrons’ holes are present, [link]. 

Equation: 


Oo = nqu, + Pq, 


How can we get a sample of semiconductor with a lot of holes in it? What 
if, instead of phosphorus, we dope our silicon sample with a group III 
element, say boron? This is shown in [link]. Now we have some missing 
orbitals, or places where electrons could go if they were around. This 
modifies our energy picture as follows in [link]. Now we see a set of new 
levels introduced by the boron atoms. They are located within the band gap, 
just a little way above the top of the almost full, or valence band. Electrons 
in the valence band can be thermally excited up into these new allowed 
levels, creating empty states, or holes, in the valence band. The excited 


electrons are stuck at the boron atom sites called acceptors, since they 
"accept" an electron from the valence band, and hence act as fixed negative 
charges, localized there. A semiconductor which is doped predominantly 
with acceptors is called p-type, and most of the electrical conduction takes 
place through the motion of holes. A semiconductor which is doped with 
donors is called n-type, and conduction takes place mainly through the 
motion of electrons. 
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A two dimensional 
representation of a 
silicon crystal lattice 
doped with boron. 
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P-type silicon, due to 
boron acceptors. 


In n-type material, we can assume that all of the phosphorous atoms, or 
donors, are fully ionized when they are present in the silicon structure. 
Since the number of donors is usually much greater than the native, or 
intrinsic electron concentration, (* 10'° cm”), if Ny is the density of donors 
in the material, then n, the electron concentration, ~ Ng. If an electron 
deficient material such as boron is present, then the material is called p-type 
silicon, and the hole concentration is just * N, the concentration of 
acceptors, since these atoms "accept" electrons from the valence band. 


If both donors and acceptors are in the material, then which ever one has the 
higher concentration wins out. This is called compensation. If there are 
more donors than acceptors then the material is n-type and n * Nj, - Ng. If 
there are more acceptors than donors then the material is p-type and p * N, 
- Nq. It should be noted that in most compensated material, one type of 
impurity usually has a much greater (several order of magnitude) 
concentration than the other, and so the subtraction process described above 
usually does not change things very much, e.g., 10/8 - 1016 = 1018. 


One other fact which you might find useful is that, again, because of 
quantum mechanics, it turns out that the product of the electron and hole 
concentration in a material must remain a constant. In silicon at room 
temperature: 

Equation: 


Thus, if we have an n-type sample of silicon doped with 10!” donors per 
cubic centimeter, then n, the electron concentration is just p , the hole 
concentration, is 102°/10!” = 10° cm’. The carriers which dominate a 
material are called majority carriers, which would be the electrons in the 
above example. The other carriers are called minority carriers (the holes in 
the example) and while 10° might not seem like much compared to 10!” the 
presence of minority carriers is still quite important and can not be ignored. 
Note that if the material is undoped, then it must be that n = p and n = p= 
10" 


The picture of "cups" of different allowed energy levels is useful for 
gaining a pictorial understanding of what is going on in a semiconductor, 
but becomes somewhat awkward when you want to start looking at devices 
which are made up of both n and p type silicon. Thus, we will introduce one 
more way of describing what is going on in our material. The picture shown 
in [link] is called a band diagram. A band diagram is just a representation 
of the energy as a function of position with a semiconductor device. In a 
band diagram, positive energy for electrons is upward, while for holes, 
positive energy is downwards. That is, if an electron moves upward, its 
potential energy increases just as a with a normal mass in a gravitational 
field. Also, just as a mass will "fall down" if given a chance, an electron 
will move down a slope shown in a band diagram. On the other hand, holes 
gain energy by moving downward and so they have a tendancy to "float" 
upward if given the chance - much like a bubble in a liquid. The line 
labeled E,, in [link] shows the edge of the conduction band, or the bottom of 
the lowest unoccupied allowed band, while E,, is the top edge of the 
valence, or highest occupied band. The band gap, E, for the material is 
obviously E, - Ey. The dotted line labeled E; is called the Fermi level and it 
tells us something about the chemical equilibrium energy of the material, 
and also something about the type and number of carriers in the material. 
More on this later. Note that there is no zero energy level on a diagram such 
as this. We often use either the Fermi level or one or other of the band edges 
as a reference level on lieu of knowing exactly where "zero energy" is 


located. 
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An electron band- 
diagram for a 
semiconductor. 


The distance (in energy) between the Fermi level and either E,, and E,, gives 
us information concerning the density of electrons and holes in that region 
of the semiconductor material. The details, once again, will have to be 
begged off on grounds of mathematical complexity. It turns out that you can 
Say: 

Equation: 


Equation: 


kT 


p=Ne 


Both N, and N,, are constants that depend on the material you are talking 
about, but are typically on the order of 10!9 cm’. The expression in the 
denominator of the exponential is just Boltzman's constant (8.63 x 10° 
eV/K), k, times the temperature T of the material (in absolute temperature 
or Kelvin). At room temperature kT = '/49 of an electron volt. Look 
carefully at the numerators in the exponential. Note first that there is a 
minus sign in front, which means the bigger the number in the exponent, the 
fewer carriers we have. Thus, the top expression says that if we have n-type 
material, then E must not be too far away from the conduction band, while 
if we have p-type material, then the Fermi level,E, must be down close to 
the valence band. The closer EF gets to E, the more electrons we have. The 
closer Ey gets to Ey, the more holes we have. [link] therefore must be for a 
sample of n-type material. Note also that if we know how heavily a sample 
is doped (i.e., we know what Nj is) and from the fact that n * Ng we can use 
[link] to find out how far away the Fermi level is from the conduction band, 
[link]. 

Equation: 


N. 


To help further in our ability to picture what is going on, we will often add 
to this band diagram, some small signed circles to indicate the presence of 
mobile electrons and holes in the material. Note that the electrons are 
spread out in energy. From our "cups" picture we know they like to stay in 
the lower energy states if possible, but some will be distributed into the 
higher levels as well. What is distorted here is the scale. The band-gap for 
silicon is 1.1 eV, while the actual spread of the electrons would probably 
only be a few tenths of an eV, not nearly as much as is shown in [link]. Lets 
look at a sample of p-type material, just for comparison. Note that for holes, 
increasing energy goes down not up, so their distribution is inverted from 
that of the electrons. You can kind of think of holes as bubbles in a glass of 
soda or beer, they want to float to the top if they can. Note also for both n 
and p-type material there are also a few "minority" carriers, or carriers of 
the opposite type, which arise from thermal generation across the band-gap. 
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n-type 
semiconductor. 


Diffusion 
The module discusses the process of electrons moving across a p-n or n-p 
junction known as diffusion. 


Note:This module is adapted from the Connexions module entitled 
Diffusion by Bill Wilson. 


Introduction 


Let us turn our attention to what happens to the electrons and holes once 
they have been injected across a forward-biased junction. We will 
concentrate just on the electrons which are injected into the p-side of the 
junction, but keep in mind that similar things are also happening to the 
holes which enter the n-side. 


When electrons are injected across a junction, they move away from the 
junction region by a diffusion process, while at the same time, some of 
them are disappearing because they are minority carriers (electrons in 
basically p-type material) and so there are lots of holes around for them to 
recombine with. This is all shown schematically in [link]. 


Injection 


Processes involved in 
electron transport 
across a p-n junction. 


Diffusion process quantified 


It is actually fairly easy to quantify this, and come up with an expression for 
the electron distribution within the p-region. First we have to look a little bit 
at the diffusion process however. Imagine that we have a series of bins, 
each with a different number of electrons in them. In a given time, we could 
imagine that all of the electrons would flow out of their bins into the 
neighboring ones. Since there is no reason to expect the electrons to favor 
one side over the other, we will assume that exactly half leave by each side. 
This is all shown in [link]. We will keep things simple and only look at 
three bins. Imagine there are 4, 6, and 8 electrons respectively in each of the 
bins. After the required "emptying time," we will have a net flux of exactly 
one electron across each boundary as shown. 


A schematic 
representation of a 
diffusion problem. 


Now let's raise the number of electrons to 8, 12 and 16 respectively ({link]). 
We find that the net flux across each boundary is now 2 electrons per 
emptying time, rather than one. Note that the gradient (slope) of the 
concentration in the boxes has also doubled from one per box to two per 
box. This leads us to a rather obvious statement that the flux of carriers is 
proportional to the gradient of their density. This is stated formally in what 


is known as Fick's First Law of Diffusion, [link]. Where D, is simply a 
proportionality constant called the diffusion coefficient. Since we are 
talking about the motion of electrons, this diffusion flux must give rise to a 
current density J.,.... Since an electron has a charge —q associated with it, 
[link]. 


Equation: 
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A schematic 
representation of a 
diffusion from bins. 


Now we have to invoke something called the continuity equation. Imagine 
we have a volume (V) which is filled with some charge (Q). It is fairly 
obvious that if we add up all of the current density which is flowing out of 
the volume that it must be equal to the time rate of decrease of the charge 
within that volume. This ideas is expressed in the formula below which uses 
a closed-surface integral, along with the all the other integrals to follow: 
Equation: 


We can write @ as, [link], where we are doing a volume integral of the 
charge density (p ) over the volume (V). Now we can use Gauss' theorem 
which says we can replace a surface integral of a quantity with a volume 
integral of its divergence, [link]. 

Equation: 
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Equation: 


So, combining [link], [link] and [link], we have, [link]. 
Equation: 
dp 
div (J) dV =— | —dV 
/ iv (J) / dt 


Finally, we let the volume V shrink down to a point, which means the 
quantities inside the integral must be equal, and we have the differential 
form of the continuity equation (in one dimension), [Link]. 

Equation: 


div (J) os 


What about the electrons? 


Now let's go back to the electrons in the diode. The electrons which have 
been injected across the junction are called excess minority carriers, 
because they are electrons in a p-region (hence minority) but their 
concentration is greater than what they would be if they were in a sample of 
p-type material at equilibrium. We will designate them as n', and since they 
could change with both time and position we shall write them as n'(x,¢). 
Now there are two ways in which n'(x,t) can change with time. One would 
be if we were to stop injecting electrons in from the n-side of the junction. 
A reasonable way to account for the decay which would occur if we were 
not supplying electrons would be to write: 

Equation: 


Where 7, called the minority carrier recombination lifetime. It is pretty easy 
to show that if we start out with an excess minority carrier concentration no' 
at t = 0, then n'(x,t) will go as, [link]. But, the electron concentration can 
also change because of electrons flowing into or out of the region x. The 


p(x,t) 
q 


electron concentration n'(x,t) is just . Thus, due to electron flow we 


have, [Link]. 


Equation: 
n'(x,t) = n'jem 
Equation: 
n'(z,t) = a sole.t) 
= a div (J(z,t)) 


But, we can get an expression for J(, t) from [link]. Reducing the 
divergence in [link] to one dimension (we just have a oh) we finally end up 


with, [Link]. 
Equation: 


d? n'(z, t) 
d x? 


Combining [link] and [link] (electrons will, after all, suffer from both 
recombination and diffusion) and we end up with: 
Equation: 


d? n'(z, t) n' (a, t) 
/ ? ’ 
—n (x,t) = De = 5 = : 


This is a somewhat specialized form of an equation called the ambipolar 
diffusion equation. It seems kind of complicated but we can get some nice 
results from it if we make some simply boundary condition assumptions. 


Using the ambipolar diffusion equation 


For anything we will be interested in, we will only look at steady state 
solutions. This means that the time derivative on the LHS of [link] is zero, 
and so letting n’(x,t) become simply n‘(a) since we no longer have any 
time variation to worry about, we have: 

Equation: 


d? 1 
Day 


n'(x) =0 


Picking the not unreasonable boundary conditions that n’(0) = no (the 
concentration of excess electrons just at the start of the diffusion region) 
and n'(a) —> 0. as 2 — oo (the excess carriers go to zero when we get far 
from the junction) then: 

Equation: 


The expression in the radical ./ D.7T, is called the electron diffusion length, 
L,, and gives us some idea as to how far away from the junction the excess 
electrons will exist before they have more or less all recombined. This will 
be important for us when we move on to bipolar transistors. A typical value 
for the diffusion coefficient for electrons in silicon would be D, = 25 
cm?/sec and the minority carrier lifetime is usually around a microsecond. 
As shown in [link] this is not very far at all. 

Equation: 
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Crystal Structure 


Introduction 


In any sort of discussion of crystalline materials, it is useful to begin with a 
discussion of crystallography: the study of the formation, structure, and 
properties of crystals. A crystal structure is defined as the particular 
repeating arrangement of atoms (molecules or ions) throughout a crystal. 
Structure refers to the internal arrangement of particles and not the external 
appearance of the crystal. However, these are not entirely independent since 
the external appearance of a crystal is often related to the internal 
arrangement. For example, crystals of cubic rock salt (NaCl) are physically 
cubic in appearance. Only a few of the possible crystal structures are of 
concern with respect to simple inorganic salts and these will be discussed in 
detail, however, it is important to understand the nomenclature of 
crystallography. 


Crystallography 


Bravais lattice 


The Bravais lattice is the basic building block from which all crystals can 
be constructed. The concept originated as a topological problem of finding 
the number of different ways to arrange points in space where each point 
would have an identical “atmosphere”. That is each point would be 
surrounded by an identical set of points as any other point, so that all points 
would be indistinguishable from each other. Mathematician Auguste 
Bravais discovered that there were 14 different collections of the groups of 
points, which are known as Bravais lattices. These lattices fall into seven 
different "crystal systems”, as differentiated by the relationship between the 
angles between sides of the “unit cell” and the distance between points in 
the unit cell. The unit cell is the smallest group of atoms, ions or molecules 
that, when repeated at regular intervals in three dimensions, will produce 
the lattice of a crystal system. The “lattice parameter” is the length between 
two points on the comers of a unit cell. Each of the various lattice 
parameters are designated by the letters a, b, and c. If two sides are equal, 


such as in a tetragonal lattice, then the lengths of the two lattice parameters 
are designated a and c, with b omitted. The angles are designated by the 
Greek letters a, B, and y, such that an angle with a specific Greek letter is 
not subtended by the axis with its Roman equivalent. For example, a is the 
included angle between the b and c axis. 


[link] shows the various crystal systems, while [link] shows the 14 Bravais 
lattices. It is important to distinguish the characteristics of each of the 
individual systems. An example of a material that takes on each of the 
Bravais lattices is shown in [link]. 


System Axial lengths and angles Parcel 
geometry 

cubic a=b=c,a= B= y= 90° 

tetragonal a=b#c,a=fB=~7y=90° 


orthorhombic a#b#c,a= 68 = y= 90° 


rhombohedral a=b=c,a=B=y77#90° 


a=b4#c,a=B=90°, y= 


h ] 
exagona 120° 
monoclinic ae ARE 
90° 
triclinic a#%~b#c,azpFy 


Geometrical characteristics of the seven crystal systems. 


simple cubic body-centered face-centered 
cubic cubic 


= 
C 


le 
| 
-» 
i 
simple body-centered 
tetragonal tetragonal 


simple body-centered 
orthorhombic orthorhombic 


base-centered face-centered 
orthorhombic orthorhombic 


rhombohedral hexagonal 
simple base-centered triclinic 
monoclinic monoclinic 


Bravais lattices. 


Crystal system Example 


triclinic K S208 


monoclinic As,S4, KNO> 
rhombohedral Hg, Sb 
hexagonal Zn, Co, NiAs 
orthorhombic Ga, Fe3C 
tetragonal In, TiO 
cubic Au, Si, NaCl 


Examples of elements and compounds that adopt each of the crystal 
systems. 


The cubic lattice is the most symmetrical of the systems. All the angles are 
equal to 90°, and all the sides are of the same length (a = b = c). Only the 
length of one of the sides (a) is required to describe this system completely. 
In addition to simple cubic, the cubic lattice also includes body-centered 
cubic and face-centered cubic ([{link]). Body-centered cubic results from the 
presence of an atom (or ion) in the center of a cube, in addition to the atoms 
(ions) positioned at the vertices of the cube. In a similar manner, a face- 
centered cubic requires, in addition to the atoms (ions) positioned at the 
vertices of the cube, the presence of atoms (ions) in the center of each of the 
cubes face. 


The tetragonal lattice has all of its angles equal to 90°, and has two out of 
the three sides of equal length (a = b). The system also includes body- 
centered tetragonal ([link]). 


In an orthorhombic lattice all of the angles are equal to 90°, while all of its 
sides are of unequal length. The system needs only to be described by three 
lattice parameters. This system also includes body-centered orthorhombic, 
base-centered orthorhombic, and face-centered orthorhombic ([link]). A 
base-centered lattice has, in addition to the atoms (ions) positioned at the 


vertices of the orthorhombic lattice, atoms (ions) positioned on just two 
opposing faces. 


The rhombohedral lattice is also known as trigonal, and has no angles equal 
to 90°, but all sides are of equal length (a = b = c), thus requiring only by 
one lattice parameter, and all three angles are equal (a = B = 4). 


A hexagonal crystal structure has two angles equal to 90°, with the other 
angle ( y) equal to 120°. For this to happen, the two sides surrounding the 
120° angle must be equal (a = b), while the third side (c) is at 90° to the 
other sides and can be of any length. 


The monoclinic lattice has no sides of equal length, but two of the angles 
are equal to 90°, with the other angle (usually defined as B) being 
something other than 90°. It is a tilted parallelogram prism with rectangular 
bases. This system also includes base-centered monoclinic ([link]). 


In the triclinic lattice none of the sides of the unit cell are equal, and none of 
the angles within the unit cell are equal to 90°. The triclinic lattice is chosen 
such that all the internal angles are either acute or obtuse. This crystal 
system has the lowest symmetry and must be described by 3 lattice 
parameters (a, b, and c) and the 3 angles (a, B, and ¥). 


Atom positions, crystal directions and Miller indices 


Atom positions and crystal axes 


The structure of a crystal is defined with respect to a unit cell. As the entire 
crystal consists of repeating unit cells, this definition is sufficient to 
represent the entire crystal. Within the unit cell, the atomic arrangement is 
expressed using coordinates. There are two systems of coordinates 
commonly in use, which can cause some confusion. Both use a corner of 
the unit cell as their origin. The first, less-commonly seen system is that of 
Cartesian or orthogonal coordinates (X, Y, Z). These usually have the units 
of Angstroms and relate to the distance in each direction between the origin 


of the cell and the atom. These coordinates may be manipulated in the same 
fashion are used with two- or three-dimensional graphs. It is very simple, 
therefore, to calculate inter-atomic distances and angles given the Cartesian 
coordinates of the atoms. Unfortunately, the repeating nature of a crystal 
cannot be expressed easily using such coordinates. For example, consider a 
cubic cell of dimension 3.52 A. Pretend that this cell contains an atom that 
has the coordinates (1.5, 2.1, 2.4). That is, the atom is 1.5 A away from the 
origin in the x direction (which coincides with the a cell axis), 2.1 A in the 
y (which coincides with the b cell axis) and 2.4 A in the z (which coincides 
with the c cell axis). There will be an equivalent atom in the next unit cell 
along the x-direction, which will have the coordinates (1.5 + 3.52, 2.1, 2.4) 
or (5.02, 2.1, 2.4). This was a rather simple calculation, as the cell has very 
high symmetry and so the cell axes, a, b and c, coincide with the Cartesian 
axes, X, Y and Z. However, consider lower symmetry cells such as triclinic 
or monoclinic in which the cell axes are not mutually orthogonal. In such 
cases, expressing the repeating nature of the crystal is much more difficult 
to accomplish. 


Accordingly, atomic coordinates are usually expressed in terms of fractional 
coordinates, (x, y, z). This coordinate system is coincident with the cell axes 
(a, b, c) and relates to the position of the atom in terms of the fraction along 
each axis. Consider the atom in the cubic cell discussion above. The atom 
was 1.5 A in the a direction away from the origin. As the a axis is 3.52 A 
long, the atom is (17/352) or 0.43 of the axis away from the origin. 
Similarly, it is (*/3.59) or 0.60 of the b axis and (74/3) or 0.68 of the c axis. 
The fractional coordinates of this atom are, therefore, (0.43, 0.60, 0.68). 
The coordinates of the equivalent atom in the next cell over in the a 
direction, however, are easily calculated as this atom is simply 1 unit cell 
away ina. Thus, all one has to do is add 1 to the x coordinate: (1.43, 0.60, 
0.68). Such transformations can be performed regardless of the shape of the 
unit cell. Fractional coordinates, therefore, are used to retain and manipulate 
crystal information. 


Crystal directions 


The designation of the individual vectors within any given crystal lattice is 
accomplished by the use of whole number multipliers of the lattice 
parameter of the point at which the vector exits the unit cell. The vector is 
indicated by the notation [hkl], where h, k, and ! are reciprocals of the point 
at which the vector exits the unit cell. The origination of all vectors is 
assumed defined as [000]. For example, the direction along the a-axis 
according to this scheme would be [100] because this has a component only 
in the a-direction and no component along either the b or c axial direction. 
A vector diagonally along the face defined by the a and b axis would be 
[110], while going from one corner of the unit cell to the opposite corner 
would be in the [111] direction. [link] shows some examples of the various 
directions in the unit cell. The crystal direction notation is made up of the 
lowest combination of integers and represents unit distances rather than 
actual distances. A [222] direction is identical to a [111], so [111] is used. 
Fractions are not used. For example, a vector that intercepts the center of 
the top face of the unit cell has the coordinates x = 1/2, y = 1/2,z = 1. All 
have to be inversed to convert to the lowest combination of integers (whole 
numbers); i.e., [221] in [link]. Finally, all parallel vectors have the same 
crystal direction, e.g., the four vertical edges of the cell shown in [link] all 
have the crystal direction [hk/] = [001]. 


Some common 
directions in a 
cubic unit cell. 


Crystal directions may be grouped in families. To avoid confusion there 
exists a convention in the choice of brackets surrounding the three numbers 
to differentiate a crystal direction from a family of direction. For a 
direction, square brackets [hkl] are used to indicate an individual direction. 
Angle brackets <hkl> indicate a family of directions. A family of directions 
includes any directions that are equivalent in length and types of atoms 
encountered. For example, in a cubic lattice, the [100], [010], and [001] 
directions all belong to the <100> family of planes because they are 
equivalent. If the cubic lattice were rotated 90°, the a, b, and c directions 
would remain indistinguishable, and there would be no way of telling on 
which crystallographic positions the atoms are situated, so the family of 
directions is the same. In a hexagonal crystal, however, this is not the case, 
so the [100] and [010] would both be <100> directions, but the [001] 
direction would be distinct. Finally, negative directions are identified with a 
bar over the negative number instead of a minus sign. 


Crystal planes 


Planes in a crystal can be specified using a notation called Miller indices. 
The Miller index is indicated by the notation [hkl] where h, k, and | are 
reciprocals of the plane with the x, y, and z axes. To obtain the Miller 
indices of a given plane requires the following steps: 


The plane in question is placed on a unit cell. 

Its intercepts with each of the crystal axes are then found. 

The reciprocal of the intercepts are taken. 

These are multiplied by a scalar to insure that is in the simple ratio of whole 
numbers. 


For example, the face of a lattice that does not intersect the y or z axis 
would be (100), while a plane along the body diagonal would be the (111) 
plane. An illustration of this along with the (111) and (110) planes is given 
in [Link]. 


1 _ 
Tepe = M10) 


Examples of Miller indices 
notation for crystal planes. 


As with crystal directions, Miller indices directions may be grouped in 
families. Individual Miller indices are given in parentheses (hkl), while 
braces {hkl} are placed around the indices of a family of planes. For 
example, (001), (100), and (010) are all in the {100} family of planes, for a 
cubic lattice. 


Description of crystal structures 


Crystal structures may be described in a number of ways. The most 
common manner is to refer to the size and shape of the unit cell and the 
positions of the atoms (or ions) within the cell. However, this information is 
sometimes insufficient to allow for an understanding of the true structure in 
three dimensions. Consideration of several unit cells, the arrangement of the 


atoms with respect to each other, the number of other atoms they in contact 
with, and the distances to neighboring atoms, often will provide a better 
understanding. A number of methods are available to describe extended 
solid-state structures. The most applicable with regard to elemental and 
compound semiconductor, metals and the majority of insulators is the close 
packing approach. 


Close packed structures: hexagonal close packing and cubic close 
packing 


Many crystal structures can be described using the concept of close 
packing. This concept requires that the atoms (ions) are arranged so as to 
have the maximum density. In order to understand close packing in three 
dimensions, the most efficient way for equal sized spheres to be packed in 
two dimensions must be considered. 


The most efficient way for equal sized spheres to be packed in two 
dimensions is shown in [link], in which it can be seen that each sphere (the 
dark gray shaded sphere) is surrounded by, and is in contact with, six other 
spheres (the light gray spheres in [link]). It should be noted that contact 
with six other spheres the maximum possible is the spheres are the same 
size, although lower density packing is possible. Close packed layers are 
formed by repetition to an infinite sheet. Within these close packed layers, 
three close packed rows are present, shown by the dashed lines in [link]. 


Schematic representation of a 
close packed layer of equal 
sized spheres. The close packed 
rows (directions) are shown by 
the dashed lines. 


The most efficient way for equal sized spheres to be packed in three 
dimensions is to stack close packed layers on top of each other to give a 
close packed structure. There are two simple ways in which this can be 
done, resulting in either a hexagonal or cubic close packed structures. 


Hexagonal close packed 


If two close packed layers A and B are placed in contact with each other so 


as to maximize the density, then the spheres of layer B will rest in the 
hollow (vacancy) between three of the spheres in layer A. This is 
demonstrated in [link]. Atoms in the second layer, B (shaded light gray), 
may occupy one of two possible positions ([link]a or b) but not both 
together or a mixture of each. If a third layer is placed on top of layer B 
such that it exactly covers layer A, subsequent placement of layers will 


result in the following sequence ...ABABAB.... This is known as hexagonal 


close packing or hcp. 


(a) (b) 


Schematic representation of two close packed 
layers arranged in A (dark grey) and B (light grey) 
positions. The alternative stacking of the B layer is 

shown in (a) and (b). 


The hexagonal close packed cell is a derivative of the hexagonal Bravais 
lattice system ({link]) with the addition of an atom inside the unit cell at the 
coordinates (1/3,7/3,'/9). The basal plane of the unit cell coincides with the 
close packed layers ({link]). In other words the close packed layer makes-up 
the {001} family of crystal planes. 


A schematic 
projection of the 
basal plane of the 
hep unit cell on the 
close packed 
layers. 


The “packing fraction” in a hexagonal close packed cell is 74.05%; that is 
74.05% of the total volume is occupied. The packing fraction or density is 
derived by assuming that each atom is a hard sphere in contact with its 
nearest neighbors. Determination of the packing fraction is accomplished 
by calculating the number of whole spheres per unit cell (2 in hcp), the 
volume occupied by these spheres, and a comparison with the total volume 
of a unit cell. The number gives an idea of how “open” or filled a structure 
is. By comparison, the packing fraction for body-centered cubic ({link]) is 
68% and for diamond cubic (an important semiconductor structure to be 
described later) is it 34%. 


Cubic close packed: face-centered cubic 


In a similar manner to the generation of the hexagonal close packed 
structure, two close packed layers are stacked ([link]) however, the third 
layer (C) is placed such that it does not exactly cover layer A, while sitting 
in a set of troughs in layer B ([link]), then upon repetition the packing 
sequence will be .. ABCABCABC.... This is known as cubic close packing 
or ccp. 


Schematic representation of the 
three close packed layers in a 
cubic close packed 
arrangement: A (dark grey), B 


(medium grey), and C (light 
grey). 


The unit cell of cubic close packed structure is actually that of a face- 
centered cubic (fcc) Bravais lattice. In the fcc lattice the close packed layers 
constitute the {111} planes. As with the hcp lattice packing fraction in a 
cubic close packed (fcc) cell is 74.05%. Since face centered cubic or fcc is 
more commonly used in preference to cubic close packed (ccp) in 
describing the structures, the former will be used throughout this text. 


Coordination number 


The coordination number of an atom or ion within an extended structure is 
defined as the number of nearest neighbor atoms (ions of opposite charge) 
that are in contact with it. A slightly different definition is often used for 
atoms within individual molecules: the number of donor atoms associated 
with the central atom or ion. However, this distinction is rather artificial, 
and both can be employed. 


The coordination numbers for metal atoms in a molecule or complex are 
commonly 4, 5, and 6, but all values from 2 to 9 are known and a few 
examples of higher coordination numbers have been reported. In contrast, 
common coordination numbers in the solid state are 3, 4, 6, 8, and 12. For 
example, the atom in the center of body-centered cubic lattice has a 
coordination number of 8, because it touches the eight atoms at the corners 
of the unit cell, while an atom in a simple cubic structure would have a 
coordination number of 6. In both fcc and hcp lattices each of the atoms 
have a coordination number of 12. 


Octahedral and tetrahedral vacancies 


As was mentioned above, the packing fraction in both fcc and hcp cells is 
74.05%, leaving 25.95% of the volume unfilled. The unfilled lattice sites 
(interstices) between the atoms in a cell are called interstitial sites or 
vacancies. The shape and relative size of these sites is important in 
controlling the position of additional atoms. In both fcc and hcp cells most 
of the space within these atoms lies within two different sites known as 
octahedral sites and tetrahedral sites. The difference between the two lies in 
their “coordination number”, or the number of atoms surrounding each site. 
Tetrahedral sites (vacancies) are surrounded by four atoms arranged at the 
comers of a tetrahedron. Similarly, octahedral sites are surrounded by six 
atoms which make-up the apices of an octahedron. For a given close packed 
lattice an octahedral vacancy will be larger than a tetrahedral vacancy. 


Within a face centered cubic lattice, the eight tetrahedral sites are positioned 
within the cell, at the general fractional coordinate of (°/4,"/4,"/4) where n = 
1 or 3, e.g., (7/4, /4,!/4), (/4,"/4,7/4), etc. The octahedral sites are located at 
the center of the unit cell (1/5,"/5,'/5), as well as at each of the edges of the 
cell, e.g., (4/5,0,0). In the hexagonal close packed system, the tetrahedral 
sites are at (0,0,°/g) and (1/3,/3,’/g), and the octahedral sites are at 
(1/3,"/3,'/4) and all symmetry equivalent positions. 


Important structure types 


The majority of crystalline materials do not have a structure that fits into the 
one atom per site simple Bravais lattice. A number of other important 
crystal structures are found, however, only a few of these crystal structures 
are those of which occur for the elemental and compound semiconductors 
and the majority of these are derived from fcc or hcp lattices. Each 
structural type is generally defined by an archetype, a material (often a 
naturally occurring mineral) which has the structure in question and to 
which all the similar materials are related. With regard to commonly used 
elemental and compound semiconductors the important structures are 
diamond, zinc blende, Wurtzite, and to a lesser extent chalcopyrite. 
However, rock salt, B-tin, cinnabar and cesium chloride are observed as 
high pressure or high temperature phases and are therefore also discussed. 


The following provides a summary of these structures. Details of the full 
range of solid-state structures are given elsewhere. 


Diamond Cubic 


The diamond cubic structure consists of two interpenetrating face-centered 
cubic lattices, with one offset '/, of a cube along the cube diagonal. It may 
also be described as face centered cubic lattice in which half of the 
tetrahedral sites are filled while all the octahedral sites remain vacant. The 
diamond cubic unit cell is shown in [link]. Each of the atoms (e.g., C) is 
four coordinate, and the shortest interatomic distance (C-C) may be 
determined from the unit cell parameter (a). 

Equation: 


Unit cell structure of a 
diamond cubic lattice 
showing the two 
interpenetrating face- 
centered cubic lattices. 


Zinc blende 


This is a binary phase (ME) and is named after its archetype, a common 
mineral form of zinc sulfide (ZnS). As with the diamond lattice, zinc blende 
consists of the two interpenetrating fcc lattices. However, in zinc blende one 
lattice consists of one of the types of atoms (Zn in ZnS), and the other 
lattice is of the second type of atom (S in ZnS). It may also be described as 
face centered cubic lattice of S atoms in which half of the tetrahedral sites 
are filled with Zn atoms. All the atoms in a zinc blende structure are 4- 
coordinate. The zinc blende unit cell is shown in [link]. A number of inter- 
atomic distances may be calculated for any material with a zinc blende unit 
cell using the lattice parameter (a). 

Equation: 


Zn-S = av3 = 0.422a 
4 


Equation: 


Zn-Zn = S-S = a_= 0.707 a 
v2 


Unit cell structure of a 


zinc blende (ZnS) lattice. 
Zinc atoms are shown in 
green (small), sulfur 
atoms shown in red 
(large), and the dashed 
lines show the unit cell. 


Chalcopyrite 


The mineral chalcopyrite CuFeS, is the archetype of this structure. The 
structure is tetragonal (a = b#c, a= 8 = y = 90°, and is essentially a 
superlattice on that of zinc blende. Thus, is easiest to imagine that the 
chalcopyrite lattice is made-up of a lattice of sulfur atoms in which the 
tetrahedral sites are filled in layers, ...FeCuCuFe..., etc. ({link]). In such an 
idealized structure c = 2a, however, this is not true of all materials with 
chalcopyrite structures. 


Unit cell structure of a 
chalcopyrite lattice. 
Copper atoms are shown 
in blue, iron atoms are 
shown in green and sulfur 
atoms are shown in 
yellow. The dashed lines 
show the unit cell. 


Rock salt 


As its name implies the archetypal rock salt structure is NaCl (table salt). In 
common with the zinc blende structure, rock salt consists of two 
interpenetrating face-centered cubic lattices. However, the second lattice is 
offset 1/2a along the unit cell axis. It may also be described as face centered 
cubic lattice in which all of the octahedral sites are filled, while all the 
tetrahedral sites remain vacant, and thus each of the atoms in the rock salt 


structure are 6-coordinate. The rock salt unit cell is shown in [link]. A 
number of inter-atomic distances may be calculated for any material with a 
rock salt structure using the lattice parameter (a). 

Equation: 


Na-Cl = a = 05a 
2 


Equation: 


Na-Na = CI-Cl = a = 0.707a 
v2 


Unit cell structure of a 
rock salt lattice. Sodium 
ions are shown in purple 

(small spheres) and 
chloride ions are shown 
in red (large spheres). 


Cinnabar 


Cinnabar, named after the archetype mercury sulfide, Hg§S, is a distorted 
rock salt structure in which the resulting cell is rhombohedral (trigonal) 
with each atom having a coordination number of six. 


Wurtzite 


This is a hexagonal form of the zinc sulfide. It is identical in the number of 
and types of atoms, but it is built from two interpenetrating hcp lattices as 
opposed to the fcc lattices in zinc blende. As with zinc blende all the atoms 
in a wurtzite structure are 4-coordinate. The wurtzite unit cell is shown in 
[link]. A number of inter atomic distances may be calculated for any 
material with a wurtzite cell using the lattice parameter (a). 

Equation: 


Zn-S = av3/8 = 0.612a = 3c = 0375c¢ 
8 


Equation: 


Zn-Zn = S-S = a = 1.632c 


However, it should be noted that these formulae do not necessarily apply 
when the ratio a/c is different from the ideal value of 1.632. 


Unit cell structure of a 
wurtzite lattice. Zinc 
atoms are shown in green 
(small spheres), sulfur 
atoms shown in red (large 
spheres), and the dashed 
lines show the unit cell. 


Cesium Chloride 


The cesium chloride structure is found in materials with large cations and 
relatively small anions. It has a simple (primitive) cubic cell ([link]) with a 
chloride ion at the corners of the cube and the cesium ion at the body center. 
The coordination numbers of both Cs* and Cl’, with the inner atomic 
distances determined from the cell lattice constant (a). 

Equation: 


Cs-Cl = ayv3 = 0.8664 
2 


Equation: 


Cs-Cs = CI-Cl =a 


B-Tin. 


The room temperature allotrope of tin is B-tin or white tin. It has a 
tetragonal structure, in which each tin atom has four nearest neighbors (Sn- 
Sn = 3.016 A) arranged in a very flattened tetrahedron, and two next nearest 
neighbors (Sn-Sn = 3.175 A). The overall structure of B-tin consists of 
fused hexagons, each being linked to its neighbor via a four-membered Sn, 
ring. 


Defects in crystalline solids 


Up to this point we have only been concerned with ideal structures for 
crystalline solids in which each atom occupies a designated point in the 
crystal lattice. Unfortunately, defects ordinarily exist in equilibrium 
between the crystal lattice and its environment. These defects are of two 
general types: point defects and extended defects. As their names imply, 
point defects are associated with a single crystal lattice site, while extended 
defects occur over a greater range. 


Point defects: “too many or too few” or “just plain wrong” 


Point defects have a significant effect on the properties of a semiconductor, 
so it is important to understand the classes of point defects and the 
characteristics of each type. [link] summarizes various classes of native 
point defects, however, they may be divided into two general classes; 
defects with the wrong number of atoms (deficiency or surplus) and defects 
where the identity of the atoms is incorrect. 


: 
- 


(a) perfect lattice (b) interstitial impurity 


: 


(c) cation vacancy (d) anion vacancy 


(e) substitution of cation (f) substitution of anion 


(g) Ba antisite defect (h) Ag antisite defect 


Point defects in a crystal lattice. 


Interstitial Impurity 


An interstitial impurity occurs when an extra atom is positioned in a lattice 
site that should be vacant in an ideal structure ([{link]b). Since all the 
adjacent lattice sites are filled the additional atom will have to squeeze itself 
into the interstitial site, resulting in distortion of the lattice and alteration in 
the local electronic behavior of the structure. Small atoms, such as carbon, 


will prefer to occupy these interstitial sites. Interstitial impurities readily 
diffuse through the lattice via interstitial diffusion, which can result in a 
change of the properties of a material as a function of time. Oxygen 
impurities in silicon generally are located as interstitials. 


Vacancies 


The converse of an interstitial impurity is when there are not enough atoms 
in a particular area of the lattice. These are called vacancies. Vacancies exist 
in any material above absolute zero and increase in concentration with 
temperature. In the case of compound semiconductors, vacancies can be 
either cation vacancies ({link]c) or anion vacancies ([{link]d), depending on 
what type of atom are “missing”. 


Substitution 


Substitution of various atoms into the normal lattice structure is common, 
and used to change the electronic properties of both compound and 
elemental semiconductors. Any impurity element that is incorporated 
during crystal growth can occupy a lattice site. Depending on the impurity, 
substitution defects can greatly distort the lattice and/or alter the electronic 
structure. In general, cations will try to occupy cation lattice sites ([link]e), 
and anion will occupy the anion site ({link]f). For example, a zinc impurity 
in GaAs will occupy a gallium site, if possible, while a sulfur, selenium and 
tellurium atoms would all try to substitute for an arsenic. Some impurities 
will occupy either site indiscriminately, e.g., Si and Sn occupy both Ga and 
As sites in GaAs. 


Antisite Defects 


Antisite defects are a particular form of substitution defect, and are unique 
to compound semiconductors. An antisite defect occurs when a cation is 
misplaced on an anion lattice site or vice versa ([link]g and h). Dependant 


on the arrangement these are designated as either Ap antisite defects or Ba 
antisite defects. For example, if an arsenic atom is on a gallium lattice site 
the defect would be an Asc, defect. Antisite defects involve fitting into a 
lattice site atoms of a different size than the rest of the lattice, and therefore 
this often results in a localized distortion of the lattice. In addition, cations 
and anions will have a different number of electrons in their valence shells, 
so this substitution will alter the local electron concentration and the 
electronic properties of this area of the semiconductor. 


Extended Defects: Dislocations in a Crystal Lattice 


Extended defects may be created either during crystal growth or as a 
consequence of stress in the crystal lattice. The plastic deformation of 
crystalline solids does not occur such that all bonds along a plane are 
broken and reformed simultaneously. Instead, the deformation occurs 
through a dislocation in the crystal lattice. [link] shows a schematic 
representation of a dislocation in a crystal lattice. Two features of this type 
of dislocation are the presence of an extra crystal plane, and a large void at 
the dislocation core. Impurities tend to segregate to the dislocation core in 
order to relieve strain from their presence. 


extra net plane 
0 . direction of slip 
—_____ >» 


dislocation 
core 


Dislocation in a crystal lattice. 


Epitaxy 


Epitaxy, is a transliteration of two Greek words epi, meaning "upon", and 
taxis, meaning "ordered". With respect to crystal growth it applies to the 
process of growing thin crystalline layers on a crystal substrate. In epitaxial 
growth, there is a precise crystal orientation of the film in relation to the 
substrate. The growth of epitaxial films can be done by a number of 
methods including molecular beam epitaxy, atomic layer epitaxy, and 
chemical vapor deposition, all of which will be described later. 


Epitaxy of the same material, such as a gallium arsenide film on a gallium 
arsenide substrate, is called homoepitaxy, while epitaxy where the film and 
substrate material are different is called heteroepitaxy. Clearly, in 
homoepitaxy, the substrate and film will have the identical structure, 
however, in heteroepitaxy, it is important to employ where possible a 
substrate with the same structure and similar lattice parameters. For 
example, zinc selenide (zinc blende, a = 5.668 A) is readily grown on 
gallium arsenide (zinc blende, a = 5.653 A). Alternatively, epitaxial crystal 
growth can occur where there exists a simple relationship between the 
structures of the substrate and crystal layer, such as is observed between 
AlyO3 (100) on Si (100). Whichever route is chosen a close match in the 
lattice parameters is required, otherwise, the strains induced by the lattice 
mismatch results in distortion of the film and formation of dislocations. If 
the mismatch is significant epitaxial growth is not energetically favorable, 
causing a textured film or polycrystalline untextured film to be grown. As a 
general rule of thumb, epitaxy can be achieved if the lattice parameters of 
the two materials are within about 5% of each other. For good quality 
epitaxy, this should be less than 1%. The larger the mismatch, the larger the 
strain in the film. As the film gets thicker and thicker, it will try to relieve 
the strain in the film, which could include the loss of epitaxy of the growth 
of dislocations. It is important to note that the <100> directions of a film 
must be parallel to the <100> direction of the substrate. In some cases, such 
as Fe on MgO, the [111] direction is parallel to the substrate [100]. The 
epitaxial relationship is specified by giving first the plane in the film that is 
parallel to the substrate [100]. 
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Structures of Element and Compound Semiconductors 


Introduction 


A single crystal of either an elemental (e.g., silicon) or compound (e.g., 
gallium arsenide) semiconductor forms the basis of almost all 
semiconductor devices. The ability to control the electronic and opto- 
electronic properties of these materials is based on an understanding of their 
structure. In addition, the metals and many of the insulators employed 
within a microelectronic device are also crystalline. 


Group IV (14) elements 


Each of the semiconducting phases of the group IV (14) elements, C 
(diamond), Si, Ge, and a-Sn, adopt the diamond cubic structure ((link]). 
Their lattice constants (a, A) and densities (p, g/cm?) are given in [link]. 


Unit cell structure of a 
diamond cubic lattice 
showing the two 
interpenetrating face- 
centered cubic lattices. 


Lattice parameter, a 


Element Density (g/cm? 
carbon 

(arsond) 3.56683(1) 3.51525 

silicon 5.4310201(3) 2.319002 
germanium 5.657906(1) 5.3234 

tin (a-Sn) 6.4892(1) 7.285 


Lattice parameters and densities (measured at 298 K) for the diamond cubic 
forms of the group IV (14) elements. 


As would be expected the lattice parameter increase in the order C < Si < 
Ge < a-Sn. Silicon and germanium form a continuous series of solid 
solutions with gradually varying parameters. It is worth noting the high 
degree of accuracy that the lattice parameters are known for high purity 
crystals of these elements. In addition, it is important to note the 
temperature at which structural measurements are made, since the lattice 
parameters are temperature dependent ([link]). The lattice constant (a), in 
A, for high purity silicon may be calculated for any temperature (T) over 
the temperature range 293 - 1073 K by the formula shown below. 


ay = 5.4304 + 1.8138 X 10° (T - 298.15 K) + 1.542 X 10°9 (T — 298.15 K) 


(a) 5.447 
5.444 

5.44] 

aA) 5 a3 
5.435 


5.432 


5.429 
0 100 200 300 400 500 600 700 800 


Temperature (°C) 


(b) 5.69 


5.66 


0 100 200 300 400 500 600 700 800 
Temperature (°C) 


Temperature dependence of the 
lattice parameter for (a) Si and 
(b) Ge. 


Even though the diamond cubic forms of Si and Ge are the only forms of 
direct interest to semiconductor devices, each exists in numerous crystalline 
high pressure and meta-stable forms. These are described along with their 
interconversions, in [link]. 


Phase 


Sil 


Si II 


Si III 


Si TV 


Si V 


Si VI 


Ge I 


Ge II 


Ge III 


Ge IV 


Structure 
diamond cubic 


grey tin 
structure 


cubic 


hexagonal 


unidentified 


hexagonal 
close packed 


diamond cubic 


B-tin structure 


tetragonal 


body centered 
cubic 


Remarks 
stable at normal pressure 


formed from Si I or Si V above 14 
GPa 


metastable, formed from Si II above 
10 GPa 


stable above 34 GPa, formed from Si 
II above 16 GPa 


stable above 45 GPa 


low-pressure phase 
formed from Ge I above 10 GPa 


formed by quenching Ge II at low 
pressure 


formed by quenching Ge II to 1 atm 
at 200 K 


High pressure and metastable phases of silicon and germanium. 


Group ITI-V (13-15) compounds 


The stable phases for the arsenides, phosphides and antimonides of 
aluminum, gallium and indium all exhibit zinc blende structures ([link]). In 
contrast, the nitrides are found as wurtzite structures (e.g., [link]). The 
structure, lattice parameters, and densities of the III-V compounds are given 


in [link]. It is worth noting that contrary to expectation the lattice parameter 
of the gallium compounds is smaller than their aluminum homolog; for 
GaAs a = 5.653 A; AlAs a = 5.660 A. As with the group IV elements the 
lattice parameters are highly temperature dependent; however, additional 
variation arises from any deviation from absolute stoichiometry. These 
effects are shown in [link]. 


Unit cell structure of a 
zinc blende (ZnS) lattice. 
Zinc atoms are shown in 

green (small), sulfur 
atoms shown in red 

(large), and the dashed 
lines show the unit cell. 


Unit cell structure of a 
wurtzite lattice. Zinc 
atoms are shown in green 
(small), sulfur atoms 
shown in red (large), and 
the dashed lines show the 


unit cell. 
Compound Structure 
AIN wurtzite 
zinc 
og blende 
AlAs zinc 


blende 


Lattice 
parameter (A) 


a = 3.11(1), c= 
4.98(1) 

a = 5.4635(4) 
a = 5.660 


Density 
(g/cm?) 


3.200 


2.40(1) 


3.760 


AlSb zinc a = 6.1355(1) 4.26 


blende 
GaN wurtzite ae Ee 
GaP ae - a = 5.4505(2) 4.138 
GaAs ae a a = 5.65325(2) 5.3176(3) 
InN wurtzite ae oi 6.81 
InP a a a = 5.868(1) 4.81 
InAs he ” a = 6.0583 5.667 
InSb ae 7 a = 6.47937 5.7747(4) 


Lattice parameters and densities (measured at 298 K) for the II-V (13-15) 
compound semiconductors. Estimated standard deviations given in 
parentheses. 


stoichiometric 


a(A) 5. 


0 10 20 30 8 40 50 60 70 
Temperature (°C) 


Temperature dependence of the lattice 
parameter for stoichiometric GaAs 
and crystals with either Ga or As 
excess. 


The homogeneity of structures of alloys for a wide range of solid solutions 
to be formed between ITI-V compounds in almost any combination. Two 
classes of ternary alloys are formed: III,-II,_,-V (e.g., Al,-Ga,.,-As) and 
IT-V1_,-Vx (e.g., Ga-Asj_,-P,) . While quaternary alloys of the type III,- 
IIT,_,-V,-V1-y allow for the growth of materials with similar lattice 
parameters, but a broad range of band gaps. A very important ternary alloy, 
especially in optoelectronic applications, is Al,-Ga,_,-As and its lattice 
parameter (a) is directly related to the composition (x). 


d = 5.6533 + 0.0078 x 


Not all of the III-V compounds have well characterized high-pressure 
phases. however, in each case where a high-pressure phase is observed the 
coordination number of both the group III and group V element increases 
from four to six. Thus, AIP undergoes a zinc blende to rock salt 
transformation at high pressure above 170 kbar, while AlSb and GaAs form 
orthorhombic distorted rock salt structures above 77 and 172 kbar, 


respectively. An orthorhombic structure is proposed for the high-pressure 
form of InP (>133 kbar). Indium arsenide (InAs) undergoes two-phase 
transformations. The zinc blende structure is converted to a rock salt 
structure above 77 kbar, which in turn forms a B-tin structure above 170 
kbar. 


Group II-VI (12-16) compounds 


The structures of the II-VI compound semiconductors are less predictable 
than those of the III-V compounds (above), and while zinc blende structure 
exists for almost all of the compounds there is a stronger tendency towards 
the hexagonal wurtzite form. In several cases the zinc blende structure is 
observed under ambient conditions, but may be converted to the wurtzite 
form upon heating. In general the wurtzite form predominates with the 
smaller anions (e.g., oxides), while the zinc blende becomes the more stable 
phase for the larger anions (e.g., tellurides). One exception is mercury 
sulfide (HgS) that is the archetype for the trigonal cinnabar phase. [link] 
lists the stable phase of the chalcogenides of zinc, cadmium and mercury, 
along with their high temperature phases where applicable. Solid solutions 
of the II-VI compounds are not as easily formed as for the III-V 
compounds; however, two important examples are ZnS,Se,_, and Cd,Hg,. 
le: 


Lattice Density 
Compound Structure parameter (A) (g/cm?) 
ZINC = 
7nS Tae a=5.410 4.075 
wurtzite ae 087 


6.260 


ZnSe Zinc a = 5.668 oa 


blende 
Zinc - 
ZntTe eiende a= 6.10 5.636 
: a = 4.136, c = 
CdS wurtzite 6.714 4.82 
. a = 4.300, c = 
CdSe wurtzite 7011 5.81 
Zinc _ 
CdTe Pleada a = 6.482 5.87 
; a=4.149,c= 
Hgs cinnabar 9.495 
ane - a= 5.851 7.73 
Zinc _ 
HgSe hewis a = 6.085 8.25 
Zinc _ 
HgTe beade a = 6.46 8.07 


Lattice parameters and densities (measured at 298 K) for the II-VI (12-16) 
compound semiconductors. 


The zinc chalcogenides all transform to a cesium chloride structure under 
high pressures, while the cadmium compounds all form rock salt high- 
pressure phases ([link]). Mercury selenide (HgSe) and mercury telluride 
(HgTe) convert to the mercury sulfide archetype structure, cinnabar, at high 
pressure. 


Unit cell structure of a 
rock salt lattice. Sodium 
ions are shown in purple 

and chloride ions are 
shown in red. 


I-III-VI, (11-13-16) compounds 


Nearly all I-III-VI, compounds at room temperature adopt the chalcopyrite 
structure ([link]). The cell constants and densities are given in [link]. 
Although there are few reports of high temperature or high-pressure phases, 
AgInS> has been shown to exist as a high temperature orthorhombic 
polymorph (a = 6.954, b = 8.264, and c = 6.683 A), and AgInTe, forms a 
cubic phase at high pressures. 


Unit cell structure of a 
chalcopyrite lattice. 
Copper atoms are shown 
in blue, iron atoms are 
shown in green and sulfur 
atoms are shown in 
yellow. The dashed lines 
show the unit cell. 


Lattice Lattice 
Compound parameter a parameter c 


(A) (A) (g.cm’) 


Density 


CuAlS> 5.02 10.430 3.45 


CuAlSep 9.61 10.92 4.69 
CuAlTe, 5.96 177 9.47 
CuGaS» 9.39 10.46 4.38 
CuGaSe» 9.61 11.00 rel 
CuGatTep 6.00 11.93 9.95 
CulnS> Di02 11.08 4.74 
CulnSe, 5.78 11.55 Deld- 
CulnTe, 6.17 12.34 6.10 
AgAIS» 6.30 11.84 6.15 
AgGaS» DLO 10.29 4.70 
AgGaSe> 5.98 10.88 5.70 
AgGatTeo 6.29 11.95 6.08 
AgInS» 5.82 11.17 4.97 
AgInSe> 6.095 11.69 5.82 
AginTe> 6.43 12,09 6.96 


Chalcopyrite lattice parameters and densities (measured at 298 K) for the I- 
II-VI compound semiconductors. Lattice parameters for tetragonal cell. 


Of the I-ITI-VI, compounds, the copper indium chalcogenides (CuInE>) are 
certainly the most studied for their application in solar cells. One of the 


advantages of the copper indium chalcogenide compounds is the formation 
of solid solutions (alloys) of the formula CulnE>_,E',, where the 
composition variable (x) varies from 0 to 2. The CulnS5_,Se, and CulnSe >. 
x le, systems have also been examined, as has the CuGa,Inj.yS7_,Sex 
quaternary system. As would be expected from a consideration of the 
relative ionic radii of the chalcogenides the lattice parameters of the 
CulnS>_,Se, alloy should increase with increased selenium content. 
Vergard's law requires the lattice constant for a linear solution of two 
semiconductors to vary linearly with composition (e.g., as is observed for 
Al,Ga;_,As), however, the variation of the tetragonal lattice constants (a 
and c) with composition for CulnS>_,S, are best described by the parabolic 
relationships. 


a = 5.532 + 0.0801 x + 0.0260 x? 

c = 11.156 + 0.1204 x + 0.0611 x? 

A similar relationship is observed for the CulnSe_,Te, alloys. 
a = 5.783 + 0.1560 x + 0.0212 x? 

c = 11.628 + 0.3340 x + 0.0277 x? 


The large difference in ionic radii between S and Te (0.37 A) prevents 
formation of solid solutions in the CulnS»_,Te, system, however, the single 
alloy CulnS, 5Teg 5 has been reported. 


Orientation effects 


Once single crystals of high purity silicon or gallium arsenide are produced 
they are cut into wafers such that the exposed face of these wafers is either 
the crystallographic {100} or {111} planes. The relative structure of these 
surfaces are important with respect to oxidation, etching and thin film 
growth. These processes are orientation-sensitive; that is, they depend on 
the direction in which the crystal slice is cut. 


Atom density and dangling bonds 


The principle planes in a crystal may be differentiated in a number of ways, 
however, the atom and/or bond density are useful in predicting much of the 
chemistry of semiconductor surfaces. Since both silicon and gallium 
arsenide are fcc structures and the {100} and {111} are the only 
technologically relevant surfaces, discussions will be limited to fcc {100} 
and {111}. 


The atom density of a surface may be defined as the number of atoms per 
unit area. [link] shows a schematic view of the {111} and {100} planes in a 
fcc lattice. The {111} plane consists of a hexagonal close packed array in 
which the crystal directions within the plane are oriented at 60° to each 
other. The hexagonal packing and the orientation of the crystal directions 
are indicated in [link]b as an overlaid hexagon. Given the intra-planar inter- 
atomic distance may be defined as a function of the lattice parameter, the 
area of this hexagon may be readily calculated. For example in the case of 
silicon, the hexagon has an area of 38.30 A*. The number of atoms within 
the hexagon is three: the atom in the center plus 1/3 of each of the six atoms 
at the vertices of the hexagon (each of the atoms at the hexagons vertices is 
shared by three other adjacent hexagons). Thus, the atom density of the 
{111} plane is calculated to be 0.0783 A’. Similarly, the atom density of 
the {100} plane may be calculated. The {100} plane consists of a square 
array in which the crystal directions within the plane are oriented at 90° to 
each other. Since the square is coincident with one of the faces of the unit 
cell the area of the square may be readily calculated. For example in the 
case of silicon, the square has an area of 29.49 A*. The number of atoms 
within the square is 2: the atom in the center plus 1/4 of each of the four 
atoms at the vertices of the square (each of the atoms at the corners of the 
square are shared by four other adjacent squares). Thus, the atom density of 
the {100} plane is calculated to be 0.0678 A-*. While these values for the 
atom density are specific for silicon, their ratio is constant for all diamond 
cubic and zinc blende structures: {100}:{111} = 1:1.155. In general, the 
fewer dangling bonds the more stable a surface structure. 


Schematic representation of the (111) and 
(100) faces of a face centered cubic (fcc) 
lattice showing the relationship between the 
close packed rows. 


An atom inside a crystal of any material will have a coordination number 
(n) determined by the structure of the material. For example, all atoms 
within the bulk of a silicon crystal will be in a tetrahedral four-coordinate 
environment (n = 4). However, at the surface of a crystal the atoms will not 
make their full compliment of bonds. Each atom will therefore have less 
nearest neighbors than an atom within the bulk of the material. The missing 
bonds are commonly called dangling bonds. While this description is not 
particularly accurate it is, however, widely employed and as such will be 
used herein. The number of dangling bonds may be defined as the 
difference between the ideal coordination number (determined by the bulk 
crystal structure) and the actual coordination number as observed at the 
surface. 


[link] shows a section of the {111} surfaces of a diamond cubic lattice 
viewed perpendicular to the {111} plane. The atoms within the bulk have a 
coordination number of four. In contrast, the atoms at the surface (e.g., the 
atom shown in blue in [link]) are each bonded to just three other atoms (the 
atoms shown in red in [link]), thus each surface atom has one dangling 
bond. As can be seen from [link], which shows the atoms at the {100} 
surface viewed perpendicular to the {100} plane, each atom at the surface 
(e.g., the atom shown in blue in [link]) is only coordinated to two other 
atoms (the atoms shown in red in [link]), leaving two dangling bonds per 


atom. It should be noted that the same number of dangling bonds are found 
for the {111} and {100} planes of a zinc blende lattice. The ratio of 
dangling bonds for the {100} and {111} planes of all diamond cubic and 
zinc blende structures is {100}:{111} = 2:1. Furthermore, since the atom 
densities of each plane are known then the ratio of the dangling bond 
densities is determined to be: {100}:{111} = 1:0.577. 


A section of the {111} 
surfaces of a diamond 
cubic lattice viewed 
perpendicular to the 
{111} plane. 


A section of the {100} 
surface of a diamond 
cubic lattice viewed 
perpendicular to the 

{100} plane. 


Silicon 


For silicon, the {111} planes are closer packed than the {100} planes. As a 
result, growth of a silicon crystal is therefore slowest in the <111> 
direction, since it requires laying down a close packed atomic layer upon 
another layer in its closest packed form. As a consequence <111> Si is the 
easiest to grow, and therefore the least expensive. 


The dissolution or etching of a crystal is related to the number of broken 
bonds already present at the surface: the fewer bonds to be broken in order 
to remove an individual atom from a crystal, the easier it will be to dissolve 
the crystal. As a consequence of having only one dangling bond (requiring 
three bonds to be broken) etching silicon is slowest in the <111> direction. 
The electronic properties of a silicon wafer are also related to the number of 
dangling bonds. 


Silicon microcircuits are generally formed on a single crystal wafer that is 
diced after fabrication by either sawing part way through the wafer 
thickness or scoring (scribing) the surface, and then physically breaking. 
The physical breakage of the wafer occurs along the natural cleavage 
planes, which in the case of silicon are the {111} planes. 


Gallium arsenide 


The zinc blende lattice observed for gallium arsenide results in additional 
considerations over that of silicon. Although the {100} plane of GaAs is 
structurally similar to that of silicon, two possibilities exist: a face 
consisting of either all gallium atoms or all arsenic atoms. In either case the 
surface atoms have two dangling bonds, and the properties of the face are 
independent of whether the face is gallium or arsenic. 


The {111} plane also has the possibility of consisting of all gallium or all 
arsenic. However, unlike the {100} planes there is a significant difference 
between the two possibilities. [link] shows the gallium arsenide structure 
represented by two interpenetrating fcc lattices. The [111] axis is vertical 
within the plane of the page. Although the structure consists of alternate 
layers of gallium and arsenic stacked along the [111] axis, the distance 
between the successive layers alternates between large and small. Assigning 
arsenic as the parent lattice the order of the layers in the [111] direction is 


As— Ga-As— Ga-As~— Ga, while in the | 111 | direction the layers are 


ordered, Ga-As-Ga— As-Ga— As ({link]). In silicon these two directions are 
of course identical. The surface of a crystal would be either arsenic, with 
three dangling bonds, or gallium, with one dangling bond. Clearly, the latter 
is energetically more favorable. Thus, the (111) plane shown in [link] is 


called the (111) Ga face. Conversely, the fii plane would be either 


gallium, with three dangling bonds, or arsenic, with one dangling bond. 


Again, the latter is energetically more favorable and the fii plane is 


therefore called the (111) As face. 


The (111) Ga face of 
GaAs showing a surface 
layer containing gallium 

atoms (green) with one 
dangling bond per 
gallium and three bonds 
to the arsenic atoms (red) 
in the lower layer. 


The (111) As is distinct from that of (111) Ga due to the difference in the 
number of electrons at the surface. As a consequence, the (111) As face 
etches more rapidly than the (111) Ga face. In addition, surface evaporation 
below 770 °C occurs more rapidly at the (111) As face. 


Bibliography 


e M. Baublitz and A. L. Ruoff, J. Appl. Phys., 1982, 53, 6179. 

J. C. Jamieson, Science, 1963, 139, 845. 

C. C. Landry, J. Lockwood, and A. R. Barron, Chem. Mater., 1995, 7, 
699. 

e M. Robbins, J. C. Phillips, and V. G. Lambrecht, J. Phys. Chem. 
Solids, 1973, 34, 1205. 

D. Sridevi and K. V. Reddy, Mat. Res. Bull., 1985, 20, 929. 

Y. K. Vohra, S. T. Weir, and A. L. Ruoff, Phys. Rev. B, 1985, 31, 7344. 
e W. M. Yin and R. J. Paff, J. Appl. Phys., 1973, 45, 1456. 


Introduction to Bipolar Transistors 


Note:This module is adapted from the Connexions module entitled 
Introduction to Bipolar Transistors by Bill Wilson. 


Let's leave the world of two terminal devices (which are all called diodes by 
the way; diode just means two-terminals) and venture into the much more 
interesting world of three terminals. The first device we will look at is 
called the bipolar transistor. Consider the structure shown in [link]: 


+ 
Emitter | Base | Collector oe 
n+ p n 


Structure of a npn bipolar 
transistor. 


The device consists of three layers of silicon, a heavily doped n-type layer 
called the emitter, a moderately doped p-type layer called the base, and 
third, more lightly doped layer called the collector. In a biasing (applied DC 
potential) configuration called forward active biasing, the emitter-base 
junction is forward biased, and the base-collector junction is reverse biased. 
[link] shows the biasing conventions we will use. Both bias voltages are 
referenced to the base terminal. Since the base-emitter junction is forward 
biased, and since the base is made of p-type material, Vp must be negative. 
On the other hand, in order to reverse bias the base-collector junction Vcp 
will be a positive voltage. 


Forward active biasing of a npn 
bipolar transistor. 


Now, let's draw the band-diagram for this device. At first this might seem 
hard to do, but we know what forward and reverse biased band diagrams 
look like, so we'll just stick one of each together. We show this in [link], 
which is a very busy figure, but it is also very important, because it shows 
all of the important features in the operation the transistor. Since the base- 
emitter junction is forward biased, electrons will go from the (n-type) 
emitter into the base. Likewise, some holes from the base will be injected 
into the emitter. 


‘Ee 


Band diagram and carrier fluxes 
in a bipolar transistor. 


In [link], we have two different kinds of arrows. The open arrows which are 
attached to the carriers, show us which way the carrier is moving. The solid 
arrows which are labeled with some kind of subscripted J, represent current 
flow. We need to do this because for holes, motion and current flow are in 
the same direction, while for electrons, carrier motion and current flow are 
in opposite directions. 


Just as we saw in the last chapter, the electrons which are injected into the 
base diffuse away from the emitter-base junction towards the (reverse 
biased) base-collector junction. As they move through the base, some of the 
electrons encounter holes and recombine with them. Those electrons which 
do get to the base-collector junction run into a large electric field which 
sweeps them out of the base and into the collector. They "fall" down the 
large potential drop at the junction. 


These effects are all seen in [link], with arrows representing the various 
currents which are associated with each of the carriers fluxes. I, represents 
the current associated with the electron injection into the base, i.e., it points 
in the opposite direction from the motion of the electrons, since electrons 
have a negative charge. Iz, represents the current associated with holes 
injection into the emitter from the base. Ip, represents recombination 
current in the base, while Jc, represents the electron current going into the 
collector. It should be easy for you to see that: 


Equation: 
Ip = Ige + Len 
Equation: 
Ip = Ign + Lpr 
Equation: 
Ic = Ice 


In a "good" transistor, almost all of the current across the base-emitter 
junction consists of electrons being injected into the base. The transistor 
engineer works hard to design the device so that very little emitter current is 


made up of holes coming from the base into the emitter. The transistor is 
also designed so that almost all of those electrons which are injected into 
the base make it across to the base-collector reverse-biased junction. Some 
recombination is unavoidable, but things are arranged so as to minimize this 
effect. 


Basic MOS Structure 


Note:This module is adapted from the Connexions module entitled Basic 
MOS Structure by Bill Wilson. 


[link] shows the basic steps necessary to make the MOS structure. It will 
help us in our understanding if we now rotate our picture so that it is 
pointing sideways in our next few drawings. [link] shows the rotated 
structure. Note that in the p-silicon we have positively charged mobile 
holes, and negatively charged, fixed acceptors. Because we will need it 
later, we have also shown the band diagram for the semiconductor below 
the sketch of the device. Note that since the substrate is p-type, the Fermi 


level is located down close to the valance band. 
polysilicon 


SiO» Op + heat ~ SiH4 + heat 
—— PELL EL ELIE ELD 


Formation of the metal-oxide- 
semiconductor (MOS) structure. 


Basic metal-oxide- 
semiconductor 
(MOS) structure. 


Let us now place a potential between the gate and the silicon substrate. 
Suppose we make the gate negative with respect to the substrate. Since the 
substrate is p-type, it has a lot of mobile, positively charged holes in it. 
Some of them will be attracted to the negative charge on the gate, and move 
over to the surface of the substrate. This is also reflected in the band 
diagram shown in [link]. Remember that the density of holes is 
exponentially proportional to how close the Fermi level is to the valence 
band edge. We see that the band diagram has been bent up slightly near the 
surface to reflect the extra holes which have accumulated there. 


Applying a 
negative gate 
voltage to a basic 
metal-oxide- 
semiconductor 
(MOS) structure. 


An electric field will develop between the positive holes and the negative 
gate charge. Note that the gate and the substrate form a kind of parallel 
plate capacitor, with the oxide acting as the insulating layer in-between 
them. The oxide is quite thin compared to the area of the device, and so it is 
quite appropriate to assume that the electric field inside the oxide is a 
uniform one. (We will ignore fringing at the edges.) The integral of the 
electric field is just the applied gate voltage V,. If the oxide has a thickness 
Xox then since E,, is uniform, it is given by, [link]. 

Equation: 


Vo 


Lox 


Fox = 


If we focus in on a small part of the gate, we can make a little "pill" box 
which extends from somewhere in the oxide, across the oxide/gate interface 
and ends up inside the gate material someplace. The pill-box will have an 
area As. Now we will invoke Gauss' law which we reviewed earlier. Gauss' 
law simply says that the surface integral over a closed surface of the 
displacement vector D (which is, of course, € x E) is equal to the total 
charge enclosed by that surface. We will assume that there is a surface 
charge density -Q, Coulombs/cm? on the surface of the gate electrode 
({link]). The integral form of Gauss' Law is just: 

Equation: 


f exk dS= Qencl 


surface charge 
density Qg 


Electric Field 


‘pill box" 
with area 4s 


Finding the surface 
charge density. 


Note that we have used €,,F in place of D. In this particular set-up the 
integral is easy to perform, since the electric field is uniform, and only 
pointing in through one surface - it terminates on the negative surface 
charge inside the pill-box. The charge enclosed in the pill box is just - 
(QgAs), and so we have (keeping in mind that the surface integral of a 
vector pointing into the surface is negative), [link], or [link]. 
Equation: 


fey E AS = —(€oxEoxA(s)) 
— (Q,A(s)) 
Equation: 
EoxEox = Qs 


Now, we can use [link] to get [link] or [link]. 


Equation: 
EoxVg Q 
Lox 
Equation: 
Qs Eox __ 
Tr = “ox 
Vy Box 


The quantity c,, is called the oxide capacitance. It has units of Farads/cm?, 
so it is really a capacitance per unit area of the oxide. The dielectric 
constant of silicon dioxide, €,,, is about 3.3 x 10°!8 F/cm. A typical oxide 
thickness might be 250 A (or 2.5 x 10° cm). In this case, c,, would be 
about 1.30 x 10°’ F/cm?. The units we are using here, while they might 


seem a little arbitrary and confusing, are the ones most commonly used in 
the semiconductor business. 


The most useful form of [link] is when it is turned around, [link], as it gives 
us a way to find the charge on the gate in terms of the gate potential. We 
will use this equation later in our development of how the MOS transistor 
really works. 

Equation: 


Qs = Con 


It turns out we have not done anything very useful by apply a negative 
voltage to the gate. We have drawn more holes there in what is called an 
accumulation layer, but that is not helping us in our effort to create a layer 
of electrons in the MOSFET which could electrically connect the two n- 
regions together. 


Let's turn the battery around and apply a positive voltage to the gate 
([link]). Actually, let's take the battery out for now, and just let V, be a 
positive value, relative to the substrate which will tie to ground. Making V, 
positive puts positive Q, on the gate. The positive charge pushes the holes 
away from the region under the gate and uncovers some of the negatively- 
charged fixed acceptors. Now the electric field points the other way, and 
goes from the positive gate charge, terminating on the negative acceptor 
charge within the silicon. 


SR WAWS 
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Increasing the voltage 

extends the depletion 

region further into the 
device. 


The electric field now extends into the semiconductor. We know from our 
experience with the p-n junction that when there is an electric field, there is 
a shift in potential, which is represented in the band diagram by bending the 
bands. Bending the bands down (as we should moving towards positive 
charge) causes the valence band to pull away from the Fermi level near the 
surface of the semiconductor. If you remember the expression we had for 
the density of holes in terms of E, and E; it is easy to see that indeed, [link], 
there is a depletion region (region with almost no holes) near the region 
under the gate. (Once Fr - E, gets large with respect to kT, the negative 
exponent causes p — 0.) 

Equation: 


> 


s 


Le; 


ON 


>, 


SS 


SS 


SS 
uv 


2 


WU 


iA 


SS 


PFs 


Threshold, E; is getting 
close to E,. 


The electric field extends further into the semiconductor, as more negative 
charge is uncovered and the bands bend further down. But now we have to 
recall the electron density equation, which tells us how many electrons we 
have: 

Equation: 


Ec—Ef 
n= N.e~ kT 


A glance at [link] reveals that with this much band bending, E, the 
conduction band edge, and E; the Fermi level are starting to get close to one 
another (at least compared to kT), which means that n, the electron 
concentration, should soon start to become significant. In the situation 
represented by [link], we say we are at threshold, and the gate voltage at 
this point is called the threshold voltage, Vr. 


Now, let's increase V, above V7. Here's the sketch in [link]. Even though we 
have increased Vg beyond the threshold voltage, V;, and more positive 
charge appears on the gate, the depletion region no longer moves back into 
the substrate. Instead electrons start to appear under the gate region, and the 
additional electric field lines terminate on these new electrons, instead of on 
additional acceptors. We have created an inversion layer of electrons under 
the gate, and it is this layer of electrons which we can use to connect the 


two n-type regions in our initial device. 


J us 


Inversion - electrons form 
under the gate. 


Where did these electrons come from? We do not have any donors in this 
material, so they can not come from there. The only place from which 
electrons could be found would be through thermal generation. Remember, 
in a semiconductor, there are always a few electron hole pairs being 
generated by thermal excitation at any given time. Electrons that get created 
in the depletion region are caught by the electric field and are swept over to 
the edge by the gate. I have tried to suggest this with the electron generation 
event shown in the band diagram in the figure. In a real MOS device, we 
have the two n-regions, and it is easy for electrons from one or both to "fall" 
into the potential well under the gate, and create the inversion layer of 
electrons. 


Introduction to the MOS Transistor and MOSFETs 


Note:This module is adapted from the Connexions modules entitled 
Introduction to MOSFETs and MOS Transistor by Bill Wilson. 


We now move on to another three terminal device - also called a transistor. 
This transistor, however, works on much different principles than does the 
bipolar junction transistor of the last chapter. We will now focus on a device 
called the field effect transistor, or metal-oxide-semiconductor field effect 
transistor or simply MOSFET. 


In [link] we have a block of silicon, doped p-type. Into it we have made two 
regions which are doped n-type. To each of those n-type regions we attach a 
wire, and connect a battery between them. If we try to get some current, J, 
to flow through this structure, nothing will happen, because the n-p junction 
on the RHS is reverse biased, i.e., the positive lead from the battery going 
to the n-side of the p-n junction. If we attempt to remedy this by turning the 
battery around, we will now have the LHS junction reverse biased, and 
again, no current will flow. If, for whatever reason, we want current to flow, 
we will need to come up with some way of forming a layer of n-type 
material between one n-region and the other. This will then connect them 
together, and we can run current in one terminal and out the other. 


p-type silicon 


The start of a field effect 


transistor. 


To see how we will do this, let's do two things. First we will grow a layer of 
SiO, (silicon dioxide or silica, but actually refered to as "oxide") on top of 
the silicon. To do this the wafer is placed in an oven under an oxygen 
atmosphere, and heated to 1100 °C. The result is a nice, high-quality 
insulating SiO> layer on top of the silicon). On top of the oxide layer we 
then deposit a conductor, which we call the gate. In the "old days" the gate 
would have been a layer of aluminum; hence the "metal-oxide-silicon" or 
MOS name. Today, it is much more likely that a heavily doped layer of 
polycrystalline silicon (polysilicon, or more often just "poly") would be 
deposited to form the gate structure. Polysilicon is made from the reduction 
of a gas, such as silane (SiH,), [link]. 

Equation: 


The silicon is polycrystalline (composed of lots of small silicon crystallites) 
because it is deposited on top of the oxide, which is amorphous, and so it 
does not provide a single crystal "matrix" which would allow the silicon to 
organize itself into one single crystal. If we had deposited the silicon on top 
of a single crystal silicon wafer, we would have formed a single crystal 
layer of silicon called an epitaxial layer. This is sometimes done to make 
structures for particular applications. For instance, growing a n-type 
epitaxial layer on top of a p-type substrate permits the fabrication of a very 
abrupt p-n junction. 


Note:Epitaxy, is a transliteration of two Greek words epi, meaning "upon", 
and taxis. meaning "ordered". Thus an epitaxial layer is one that follows 
the order of the substrate on which it is grown. 


Now we can go back now to our initial structure, shown in [link], only this 
time we will add an oxide layer, a gate structure, and another battery so that 
we can invert the region under the gate and connect the two n-regions 
together. Well also identify some names for parts of the structure, so we will 
know what we are talking about. For reasons which will be clear later, we 
call the n-region connected to the negative side of the battery the source, 
and the other one the drain. We will ground the source, and also the p-type 
substrate. We add two batteries, V;, between the gate and the source, and 
Vas between the drain and the source. 
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p-type substrate 


Biasing a MOSFET transistor 


It will be helpful if we also make another sketch, which gives us a 
perspective view of the device. For this we strip off the gate and oxide, but 
we will imagine that we have applied a voltage greater than Vr to the gate, 
so there is a n-type region, called the channel which connects the two. We 
will assume that the channel region is Z long and W wide, as shown in 
[link]. 


BU dx 


The inversion channel and its 
resistance. 


Next we want to take a look at a little section of channel, and find its 
resistance a(R), when the little section is O(a) long, [link]. 
Equation: 


We have introduced a slightly different form for our resistance formula 
here. Normally, we would have a simple o in the denominator, and an area 
A, for the cross-sectional area of the channel. It turns out to be very hard to 
figure out what that cross sectional area of the channel is however. The 
electrons which form the inversion layer crowd into a very thin sheet of 
surface charge which really has little or no thickness, or penetration into the 
substrate. 


If, on the other hand we consider a surface conductivity (units: simply 
mhos), o, [link], then we will have an expression which we can evaluate. 
Here, jz; is a surface mobility, with units of cm2/V.sec, that is the quantity 
which represented the proportionality between the average carrier velocity 
and the electric field, [link] and [link]. 


Equation: 
Os = HsQ chan 
Equation: 
v=pE 
Equation: 
qT 
an 


The surface mobility is a quantity which has to be measured for a given 
system, and is usually just a number which is given to you. Something 
around 300 cm?/V.sec is about right for silicon. Q chan is called the surface 
charge density or channel charge density and it has units of Coulombs/cm?. 
This is like a sheet of charge, which is different from the bulk charge 
density, which has units of Coulombs/cm2. Note that: 


Equation: 
2 Coul 
cm Coulombs __ sec 
Volt sec cm? “Volt 
ees 
—~ VY 
= mbhos 


It turns out that it is pretty simple to get an expression for Q chan, the surface 
charge density in the channel. For any given gate voltage V,,, we know that 
the charge density on the gate is given simply as: 

Equation: 


Qo = Cox Ves 


However, until the gate voltage V,., gets larger than Vr we are not creating 
any mobile electrons under the gate, we are just building up a depletion 
region. We'll define Q as the charge on the gate necessary to get to 
threshold. Q-7 = CoxVr. Any charge added to the gate above Q7 is 
matched by charge @ chan in the channel. Thus, it is easy to say: [link] or 
[link]. 

Equation: 


OQ) saan = Q, _ Qr 


Equation: 


Q chan = Cox (V, _ Vr) 


Thus, putting [link] and [link] into [link], we get: 
Equation: 


_ al(x) 
7 LsCox (Vas = Vr)W 


If you look back at [link], you will see that we have defined a current Ig 
flowing into the drain. That current flows through the channel, and hence 
through our little incremental resistance a(R), creating a voltage drop 
a(V.) across it, where V; is the channel voltage, [link]. 

Equation: 


al(V.(«)) 


Iydl(R) 
Izdl(x) 
[sCox(Ves—Vr)W 


Let's move the denominator to the left, and integrate. We want to do our 
integral completely along the channel. The voltage on the channel V,(z) 
goes from 0 on the left to Vg, on the right. At the same time, z is going 
from 0 to L. Thus our limits of integration will be 0 and Vg, for the voltage 
integral Q(V.(x)) and from 0 to L for the z integral a/(z). 

Equation: 


Vas LT 
i [sCox (Ves - Vr Wd Ve = / Igdz 
0 0 


Both integrals are pretty trivial. Let's swap the equation order, since we 
usually want J/g as a function of applied voltages. 
Equation: 


IgL = [LsCoxW (Vas ad Vr) Vas 


We now simply divide both sides by Z, and we end up with an expression 
for the drain current Jg, in terms of the drain-source voltage, Vg;, the gate 


voltage V,, and some physical attributes of the MOS transistor. 
Equation: 


sCoxW 
i= ( eet Vee v1) Vai 


Light Emitting Diode 
Light Emitting Diode 


Note:This module is adapted from the Connexions module entitled Light 
Emitting Diode by Bill Wilson. 


Let's talk about the recombining electrons for a minute. When the electron 
falls down from the conduction band and fills in a hole in the valence band, 
there is an obvious loss of energy. The question is; where does that energy 
go? In silicon, the answer is not very interesting. Silicon is what is known 
as an indirect band-gap material. What this means is that as an electron 
goes from the bottom of the conduction band to the top of the valence band, 
it must also undergo a significant change in momentum. This all comes 
about from the details of the band structure for the material, which we will 
not concern ourselves with here. As we all know, whenever something 
changes state, we must still conserve not only energy, but also momentum. 
In the case of an electron going from the conduction band to the valence 
band in silicon, both of these things can only be conserved if the transition 
also creates a quantized set of lattice vibrations, called phonons, or "heat". 
Phonons posses both energy and momentum, and their creation upon the 
recombination of an electron and hole allows for complete conservation of 
both energy and momentum. All of the energy which the electron gives up 
in going from the conduction band to the valence band (1.1 eV) ends up in 
phonons, which is another way of saying that the electron heats up the 
crystal. 


In some other semiconductors, something else occurs. In a class of 
materials called direct band-gap semiconductors, the transition from 
conduction band to valence band involves essentially no change in 
momentum. Photons, it turns out, possess a fair amount of energy (several 
eV/photon in some cases) but they have very little momentum associated 
with them. Thus, for a direct band gap material, the excess energy of the 
electron-hole recombination can either be taken away as heat, or more 
likely, as a photon of light. This radiative transition then conserves energy 


and momentum by giving off light whenever an electron and hole 
recombine. This gives rise to the light emitting diode (LED). Emission of a 
photon in an LED is shown schematically in [link]. 
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Radiative recombination in a 
direct band-gap semiconductor. 


It was Planck who postulated that the energy of a photon was related to its 
frequency by a constant, which was later named after him. If the frequency 
of oscillation is given by the Greek letter "nu" (v), then the energy of the 
photon is just given by, [link], where h is Planck's constant, which has a 
value of 4.14 x 10° eV.sec. 

Equation: 


E=hv 


When we talk about light it is conventional to specify its wavelength, A, 
instead of its frequency. Visible light has a wavelength on the order of 
nanometers, e.g., red is about 600 nm, green about 500 nm and blue is in 
the 450 nm region. A handy "rule of thumb" can be derived from the fact 
that c = Av, where c is the speed of light (3 x 10° m/sec or 3 x 10!” nm/sec, 
[link]. 

Equation: 


A(nm) = ev) 


1242 
E(eV) 


Thus, a semiconductor with a 2 eV band-gap should give off light at about 
620 nm (in the red). A 3 eV band-gap material would emit at 414 nm, in the 
violet. The human eye, of course, is not equally responsive to all colors 
({link]). The materials which are used for important light emitting diodes 
(LEDs) for each of the different spectral regions are also shown in [link]. 
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Relative response of the human 
eye to various colors. 


It is worth noting that a number of the important LEDs are based on the 
GaAsP system. GaAs is a direct band-gap semiconductor with a band gap 
of 1.42 eV (in the infrared). GaP is an indirect band-gap material with a 
band gap of 2.26 eV (550 nm, or green). Both As and P are group V 
elements. (Hence the nomenclature of the materials as III-V (or 13-15) 
compound semiconductors.) We can replace some of the As with P in GaAs 
and make a mixed compound semiconductor GaAsj_,P,. When the mole 
fraction of phosphorous is less than about 0.45 the band gap is direct, and 
so we can "engineer" the desired color of LED that we want by simply 
growing a crystal with the proper phosphorus concentration! The properties 
of the GaAsP system are shown in [link]. It turns out that for this system, 
there are actually two different band gaps, as shown in [link]. One is a 
direct gap (no change in momentum) and the other is indirect. In GaAs, the 
direct gap has lower energy than the indirect one (like in the inset) and so 
the transition is a radiative one. As we start adding phosphorous to the 
system, both the direct and indirect band gaps increase in energy. However, 
the direct gap energy increases faster with phosphorous fraction than does 


the indirect one. At a mole fraction x of about 0.45, the gap energies cross 
over and the material goes from being a direct gap semiconductor to an 
indirect gap semiconductor. At x = 0.35 the band gap is about 1.97 eV (630 
nm), and so we would only expect to get light up to the red using the 
GaAsP system for making LED's. Fortunately, people discovered that you 
could add an impurity (nitrogen) to the GaAsP system, which introduced a 
new level in the system. An electron could go from the indirect conduction 
band (for a mixture with a mole fraction greater than 0.45) to the nitrogen 
site, changing its momentum, but not its energy. It could then make a direct 
transition to the valence band, and light with colors all the way to the green 
became possible. The use of a nitrogen recombination center is depicted in 
the [link]. 
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If we want colors with wavelengths shorter than the green, we must 
abandon the GaAsP system and look for more suitable materials. A 
compound semiconductor made from the II-VI elements Zn and Se make up 
one promising system, and several research groups have successfully made 
blue and blue-green LEDs from ZnSe. SiC is another (weak) blue emitter 
which is commercially available on the market. Recently, workers at a tiny, 
unknown chemical company stunned the "display world" by announcing 
that they had successfully fabricated a blue LED using the II-V material 
GaN. A good blue LED was the "holy grail" of the display and CD ROM 
research community for a number of years. Obviously, adding blue to the 
already working green and red LED's completes the set of 3 primary colors 
necessary for a full-color flat panel display. Furthermore, using a blue LED 
or laser in a CD ROM would more than quadruple its data capacity, as bit 
diameter scales as A, and hence the area as A2. 


Polymer Light Emitting Diodes 


This module was developed as part of a Rice University course CHEM496: 
Chemistry of Electronic Materials. This module was prepared with the assistance 
of Pui Yee Hung. 


Introduction 


In 1990, electroluminescent (EL) from conjugated polymers was first reported by 
Burroughes et al. of Cambridge University. A layer of poly(para- 
phenylenevinylene) (PPV) was sandwiched between layers of indium tin oxide 
(ITO) and aluminum. When this device is under a 14 V dc bias, the PPV emits a 
yellowish-green light with a quantum efficiency of 0.05%. This report attracted a 
lot of attention, because the potential that polymer light emitting diodes (LEDs) 
could be inexpensively mass produced into large area display area. The processing 
steps in making polymer LEDs are readily scaleable. The industrial coating 
techniques is well developed to mass produce polymer layers of 100 nm 
thickness, and the device could be patterned onto large surface area by pixellation 
of metal. 


Since the initial discovery, and increasing amount of researches has been 
performed, and significant progress has been made. In 1990 the polymer LED 
only emitted yellowish green color, now the emission color ranged from deep blue 
to near infra red. The efficiency of the multi-layer polymer LED even reached a 
quantum efficiency of >4% and the operating voltage has been reduced 
significantly. In term of efficiency, color selection and operating voltage, polymer 
LEDs have attained adequate levels for commercialization. But there are 
reliability problems that are symptomatic of any organic devices. 


Device physics and materials science of polymer LEDs 


A schematic diagram of a polymer LED is shown in [link]. A polymer LED can 
be divided into three different components: 


A. Anode: the hole supplier, made of metal of high working function. Examples 
of the common anode are indium tin oxide (ITO), gold etc. The anode is 
usually transparent so that light can be emitted through. 

B. Cathode: the electron supplier, made of metal of low working function. 
Examples of the common cathode are aluminum or calcium. 

C. Polymer: made of conjugated polymer film with thickness of 100 nm. 


Cathode (aluminum) 


Anode (ITO) 


Substrate 
(glass) 


Emitted light 


Schematic set-up of polymer LED. 


When a polymer LED is under a direct current (dc) bias, holes are injected from 
the anode (ITO) and electrons are injected from the cathode (aluminum). Under 
the influences of the electrical field, the electrons and holes will migrate toward 
each other. When they recombine in the conjugated polymer layer, a bound 
excited states (excitons) will be formed. Some of the excitons (singlets) then 
decays in the conjugated polymer layer to emit light through the transparent 
substrates (glass). The emission color will be depended on the energy gap of the 
polymers. There is energy gap in a conjugated polymer because the m electron are 
not completely delocalized over the entire polymer chain. Instead there are 
alternate region in the polymer chain that has a higher electron density ({link]a). 
The chain length of this region is about 15-20 multiple bonds. The emission color 
can be controlled by tuning this energy band gap ((link]|b). It shows that bond 
alternation limits the extent of delocalization. [link] summarizes the structure and 
emission color of some common conjugated polymers. 


(a) 


Alternation of bond lengths along a conjugated polymer 
chain (a) results in a material with properties of a large 
band gap semiconductor (b), where CB is the conductive 
band gap, and VB is the valence band, and E, is the band 
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Example of common conjugated polymers. 


Approaches to improve the efficiency 
Efficiency for any LED is defined: 


Next = Desc ~ int 


where Nexis the external quantum efficiency, nj; is the internal efficiency 
(represents the fraction of injected carrier, usually electron, that is converted to 
photon), and n,,, is the escape efficiency (represent fraction of photons that can 
reach to the outside). 


The most common way to improve the internal efficiency is to balance the 
number of electrons and holes which arrives at the polymer layer. Originally, there 
are more holes than electron that arrive of the polymer layer because conjugated 
polymers have a higher electron affinity, and as a consequence will favor the 
transport of hole than electron. There are two ways to maintains the balance: 


1. Match the work function of electrode with the electron affinity and ionization 
potential of the polymer. 

2. Tune the polymer’s electron affinity and ionization potential to match the 
work function of the electrode. 


The escape efficiency is also important because a polymer LED is made up of 
layers of materials that have different refractive index, and some of the photon 
generated from the excition may be reflected at the boundary and trapped inside 
the device. 


Improvement in internal quantum efficiency using low working function 
cathode 


Conjugated polymer is electron rich, the mobility for hole is higher than electron, 
and more holes will arrive in the polymer layer than electrons. One way to 
increase the population of the electron is to use a lower working function metal as 
cathode. Braun and Heeger have replaced the aluminum cathode with calcium 
results in improved internal efficiency by a factor of ten, to 0.1%. This approach 
is direct and fast but low working function electrode like calcium will be oxidized 
easily and shorten the devices’ life. 


Improvement in internal quantum efficiency using multiple polymer layers 


A layer of poly[2,5-di(hexyloxy)cyanoterephthalylidene] (CN-PPYV, [link]) is 
coated on top of PPV to improve the transport and recombination of electron and 
holes ({link]). 
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Structure of CN-PPV. 
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Schematic representation of a CN-PPV and PPV 
multi-layer LED. 


The nitrile group in the CN-PPV has two effect on the polymer. 


1. It increases the electron affinity so electrons can travel more efficient from 
the aluminum to the polymer layer. And metal of relative high working 
function like aluminum and gold can be now be used as cathode instead of 
calcium. 

2. It increases the binding energy of the occupied m and unoccupied m* state but 
maintain a similar m-1* gap. So when the PPV and CN-PPV is placed 


together, holes and electron will be confined at the heterojunction. 


The resulting energy levels are shown in [link]. 


PPV CN-PPV 


Schematic energy-level diagram for a PPV and 
CN-PPV under foreword bias. Adapted from N. C. 
Greenham, S. C. Maratti, D. D. C. Bradley, R. H. 
Friend, and A. B. Holmes, Nature,1993, 365, 62. 


The absolute energies of levels are not known accurately, but the diagram show 
the relative position of the HOMO and LUMO levels in the polymers, and the 
Fermi levels of the various possible metal contacts, the differences in electron 
affinity (AEA) and ionization potential (AIP) between PPV and CN-PPV are also 
shown ([link]). 


At the polymers interface there is a sizable offset in the energies of HOMO and 
LUMO of PPV and CN-PPY, the holes transported from the ITO and the electrons 
transport from the aluminum will be confined in the heterojunction. The local 
charge density will be sufficiently high to ensure the holes and electrons will pass 
within a collision capture radius. This set-up increases the chance for an electrons 
to combine with holes to form an excition. In addition, the emission will be close 
to the junction, far away from the electrode junction which will quench the singlet 
excitions. The result is that a multi-layers LED has an internal quantum efficiency 


of 10% and external quantum efficiency (for light emitted in foreword direction) 
of 25%. 


Based on this approach, a couple of polymers have been developed or modified to 
produce the desirable emission color and processing property. The drawback of 
this method is that desirable properties may not be commentary to each other. For 
example, in MEH-PPV an alkoxy side group (RO) is introduced to PPV so that it 
can be dissolved in organic solvent. But the undesirable effect is that MEH-PPV is 
less thermally stable. Moreover in multiple layers LEDs, different polymer layers 
have different refractive indices and a fraction of the photons will undergo total 
internal reflection at the refractive boundaries and cannot escape as light. This 
problem can be overcome by Febry-Pert microcavity structure. 


Improvement in external quantum efficiency using microcavity 


Fabry-Perot resonant structures are also used in inorganic LED, and are is based 
on Fermi’s golden rule: 


K,~ | <M > | tw 


where M (the matrix element of the perturbation between final and initial states) 
depends on the nature of the material, and r;,) can be altered by changing the 
density of various density states, e.g. using a luminescent thin films to select 
certain value of V. 


In building a microcavity for a polymer LED, the polymer is placed between two 
mirrors. ((link]), in which one of the mirrors is made up of aluminum, the other 
mirror (a Bragg Mirror) is form by epitaxial multilayer stacks of Si,Ny and SiOp. 


Aluminum mirror 


LEKI RAL AL 
REEL Polymer 
AN 


Si,N, 
SiO, 


Schematic set-up of micro-cavity. 


Improvement in internal quantum efficiency: doping of polymer 


Doping is a process that creates carrier by purposely introducing impurities and is 
very popular method in the semiconductor industry. However, this technique was 
not used in polymer LED until 1995, when a co-polymer polystyrene-poly(3- 
hexylthiphene) (PS-P3HT) was doped with FeCl; Doping of MEH-PPV with 
iodine has improved the efficiency by 200% and the polymer LED can be 
operated under both forward and reverse bias ([link]). The doping is accomplsihed 
by mixing 1 wt% MEH-PPV with 0.2 wt% Ip. The molar ratio of MEH-PPV to Ip 
is 5:1. That is a huge “doping “ ratio when you compare the doping concentration 
in the semiconductor. 


Un-doped Doped 
Turn on voltage (V) 10 foreword 5, reversed 12 


External efficiency (%) 4x 104 8x 10° 


Results of iodine doping of an Al/MEH-PPV/ITO-based LED. 


Polymer LEDs on a silicon substrate: an application advantage over 
inorganic LEDs 


In the initial research polymer LEDs were in direct competition with the inorganic 
LEDs and tried to achieve the existing LED standard. This is a difficult task as 
polymer LEDs have a lower long term stability. However, there are some 
applications in which polymer LEDs have a clear advantage over their more 
traditional inorganic analogs. One of these is to incorporate LEDs with the silicon 
integrated circuits for inter-chip communication. 


It is difficult to build inorganic LEDs on a silicon substrate, because of the 
thermal stress developing between the inorganic LED (usually a HI-V based 
device) and the silicon interface. But polymer LEDs offer a solution, since 
polymers can be easily spin-coated on the silicon. The operating voltage of 
polymer LED is less than 4 V, and the turn on voltage can be as low as 2 V. 
Together with a switching time of less than 50 ns, make polymer LED a perfect 
candidate. 


Reliability and degradation of polymer LEDs 


In terms of the efficiency, color selection, and driving voltage, polymer LED have 
attained adequate level for commercialization. However, the device lifetime is still 
far from satisfactory. Research into understanding the reliability and degredation 
mechanisms of polymer LEDs has generally been divided into two area: 


1. Photo-degradation of polymer. 
2. Interface degradation. 


Polymer photo degradation 


Photoluminescece (PL) studies of the photo-oxidation of PPV have been 
undertaken, since it is believes that EL is closely related with PL. 


It was found that there is a rapid decay in emission when PPV is exposed to 
oxygen. Using time resolved FTIR spectroscopy an increase in the carbonyl signal 
and a decrease in C=C signal with time ((link]). It was suggested that the carbonyl 


group has a strong electron affinity level to charge transfer between molecules 
segment in the polymer, thereby dissociating the excition and quenching the PL. 
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FTIR as a function of photo-oxidation of 
PPV. Adapted from M. Yan, L. J. Rothberg, 
F, Papadimitrakopoulos, M. E. Galvin and T. 

M. Miller, Phys. Rev. Lett., 1994, 73, 744. 


Similar research was performed by Cumpston and Jensen using BCHA-PPV and 
P30T ([link]) and exposing them to dry air in UV irradiation. In BCHA-PPV, 
there is an increase in carbonyl signal with time, while the P3OT remain intact. A 
mechanism proposed for the degradation of BCHA-PPV involves the transfer of 
energy from the excited triplet state of the PPV to oxygen to from singlet oxygen 
which attack the vinyl double bond in the PPV backbone. And P3O0T dose not has 
vinyl bond so it can resist the oxidation . 


(a) (b) 


Structure of (a) BCHA-PPV and (b) P30T. 


The research described above was all performed on polymer thin films deposited 
on an inert surface. The presence of cathode and anode may also affect the 
oxidation mechanism. Scott et al. have taken IR spectra from a MEH-PPV LED in 
the absence of oxygen. They obtained similar result as in Yan et al., however, a 
decrease in ITO’s oxygen signal was noticed suggesting that the ITO anode acts 
like a oxygen reservoir and supplies the oxygen for the degradation process. 


Polymer LED interface degradation 


There are few interface degredation studies in polymer LEDs. One of them by 
Scott et al. took SEM image of the cathode from a failed polymer LED. The 
polymer LED used ITO as the anode, MEH-PPV as the polymer layer, and an 
aluminum calcium alloy as cathode. SEM images showed “craters” formed in the 
cathode. The craters are formed when the cathode metal is melted and pull away 
from the polymer layer. It was suggested that a high current density will generate 
heat and result in local hot spot. The temperature in the hot spot is high enough to 
melt the cathode. And when it melt, it will pull away from the polymer. This 
process will decrease the effective cathode area, and reduce the luminescence 
gradually. 
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Laser 
LASER 


Note:This module is adapted from the Connexions module entitled LASER 
by Bill Wilson. 


What is the difference between an LED and a solid state laser? There are 
some differences, but both devices operate on the same principle of having 
excess electrons in the conduction band of a semiconductor, and arranging 
it so that the electrons recombine with holes in a radiative fashion, giving 
off light in the process. What is different about a laser? In an LED, the 
electrons recombine in a random and unorganized manner. They give off 
light by what is known as spontaneous emission, which simply means that 
the exact time and place where a photon comes out of the device is up to 
each individual electron, and things happen in a random way. 


There is another way in which an excited electron can emit a photon 
however. If a field of light (or a set of photons) happens to be passing by an 
electron in a high energy state, that light field can induce the electron to 
emit an additional photon through a process called stimulated emission. The 
photon field stimulates the electron to emit its energy as an additional 
photon, which comes out in phase with the stimulating field. This is the big 
difference between incoherent light (what comes from an LED or a 
flashlight) and coherent light which comes from a laser. With coherent 
light, all of the electric fields associated with each phonon are all exactly in 
phase. This coherence is what enables us to keep a laser beam in tight 
focus, and to allow it to travel a large distance without much divergence or 
spreading out. 


So how do we restructure an LED so that the light is generated by 
stimulated emission rather than spontaneous emission? Firstly, we build 
what is called a heterostructure. All this means is that we build up a 
sandwich of somewhat different materials, with different characteristics. In 
this case, we put two wide band-gap regions around a region with a 


narrower band gap. The most important system where this is done is the 
AlGaAs/GaAs system. A band diagram for such a set up is shown in [link]. 
AlGaAs (pronounced "Al-Gas") has a larger band-gap then does GaAs. The 
potential "well" formed by the GaAs means that the electrons and holes will 
be confined there, and all of the recombination will occur in a very narrow 
strip. This greatly increases the chances that the carriers can interact, but we 
still need some way for the photons to behave in the proper manner. [Link] is 
a diagram of what a typical diode might look like. We have the active GaAs 
layer sandwich in-between the two heterostructure confinement layers, with 
a contact on top and bottom. On either end of the device, the crystal has 
been "cleaved" or broken along a crystal lattice plane. This results in a 
shiny "mirror-like" surface, which will reflect photons. The back surface 
(which we can not see here) is also cleaved to make a mirror surface. The 
other surfaces are purposely roughened so that they do not reflect light. 
Now let us look at the device from the side, and draw just the band diagram 
for the GaAs region ({link]). We start things off with an electron and hole 
recombining spontaneously. This emits a photon which heads towards one 
of the mirrors. As the photon goes by other electrons, however, it may cause 
one of them to decay by stimulated emission. The two (in phase) photons 
hit the mirror and are reflected and start back the other way . As they pass 
additional electrons, they stimulate them into a transition as well, and the 
optical field within the laser starts to build up. After a bit, the photons get 
down to the other end of the cavity. The cleaved facet, while it acts like a 
mirror, is not a perfect one. Some light is not reflected, but rather "leaks"; 
though, and so becomes the output beam from the laser. The details of 
finding what the ratio of reflected to transmitted light is will have to wait 
until later in the course when we talk about dielectric interfaces. The rest of 
the photons are reflected back into the cavity and continue to stimulate 
emission from the electrons which continue to enter the gain region because 
of the forward bias on the diode. 


n-AlGaAs GaAs p-AlGaAs 


The band diagram for a double 
heterostructure GaAs/AlGaAs 
laser. 


A schematic diagram of a 
typical laser diode. 


Build up of a photon 
field in a laser diode. 


In reality, the photons do not move back and forth in a big "clump" as we 
have described here, rather they are distributed uniformly along the gain 
region ([link]). The field within the cavity will build up to the point where 
the loss of energy by light leaking out of the mirrors just equals the rate at 
which energy is replaced by the recombining electrons. 
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Output coupling in a diode 
laser. 


Solar Cells 


Note:This module is adapted from the Connexions module entitled Solar 
Cells by Bill Wilson. 


Now let us look at the opposite process of light generation for a moment. 
Consider the following situation where we have just a plain old normal p-n 
junction, only now, instead of applying an external voltage, we imagine that 
the junction is being illuminated with light whose photon energy is greater 
than the band-gap ({link]a). In this situation, instead of recombination, we 
will get photo-generation of electron hole pairs. The photons simply excite 
electrons from the full states in the valence band, and "kick" them up into 
the conduction band, leaving a hole behind. This is similiar to the thermal 
excitation process. As can be seen from [link]b, this creates excess electrons 
in the conduction band in the p-side of the diode, and excess holes in the 
valence band of the n-side. These carriers can diffuse over to the junction, 
where they will be swept across by the built-in electric field in the depletion 
region. If we were to connect the two sides of the diode together with a 
wire, a current would flow through that wire as a result of the electrons and 
holes which move across the junction. 


A schematic representation of a p-n diode under 
illlumination. 


Which way would the current flow? A quick look at [link]c shows that 
holes (positive charge carriers) generated on the n-side will float up to the 
p-side as they go across the junction. Hence positive current must be 
coming out of the anode, or p-side of the junction. Likewise, electrons 
generated on the p-side will fall down the junction potential, and come out 
the n-side, but since they have negative charge, this flow represents current 
going into the cathode. We have constructed a photovoltaic diode, or solar 
cell. [link] is a picture of what this would look like schematically. We might 
like to consider the possibility of using this device as a source of energy, but 
the way we have things set up now, since the voltage across the diode is 
zero, and since power equals current times voltage, we see that we are 
getting nada from the cell. What we need, obviously, is a load resistor, so 
let's put one in. It should be clear from [link] that the photo current flowing 
through the load resistor will develop a voltage which it biases the diode in 
the forward direction, which, of course will cause current to flow back into 


the anode. This complicates things, it seems we have current coming out of 
the diode and current going into the diode all at the same time! How are we 
going to figure out what. what is going on? 
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Schematic representation 
of a photovoltaic cell. 
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Photovoltaic cell 
with a load 
resistor. 


The answer is to make a model. The current which arises due to the photon 
flux can be conveniently represented as a current source. We can leave the 
diode as a diode, and we have the circuit shown in [link]. Even though we 
show I,,; coming out of the device, we know by the usual polarity 
convention that when we define V,,; as being positive at the top, then we 
should show the current for the photovoltaic, I,y as current going into the 
top, which is what was done in [Link]. Note that Ipy = Idiode - Iphoto, $0 all we 
need to do is to subtract the two currents; we do this graphically in [link]. 


Note that we have numbered the four quadrants in the I-V plot of the total 
PV current. In quadrant I and III, the product of I and V is a positive 
number, meaning that power is being dissipated in the cell. For quadrant IT 
and IV, the product of I and V is negative, and so we are getting power from 
the device. Clearly we want to operate in quadrant IV. In fact, without the 
addition of an external battery or current source, the circuit, will only run in 
the IV'th quadrant. Consider adjusting R;,, the load resistor from 0 (a short) 
to co (an open). With R;,, we would be at point A on [link]. As Rj, starts to 
increase from zero, the voltage across both the diode and the resistor will 
start to increase also, and we will move to point B, say. As Ry, gets bigger 
and bigger, we keep moving along the curve until, at point C, where Ry, is 
an open and we have the maximum voltage across the device, but, of 


course, no current coming out! 
lout ————_» 


A model of a PV cell. 
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Combining the diode and 
the current source. 


Power is VJ so at B for instance, the power coming out would be 
represented by the area enclosed by the two dotted lines and the coordinate 
axes. Someplace about where I have point B would be where we would be 
getting the most power out of out solar cell. 


[link] shows you what a real solar cell would look like. They are usually 
made from a complete wafer of silicon, to maximize the usable area. A 
shallow (0.25 tm) junction is made on the top, and top contacts are applied 
as stripes of metal conductor as shown. An anti-reflection (AR) coating is 
applied on top of that, which accounts for the bluish color which a typical 
solar cell has ({link]). 


Solar Cell Wafer 


top contact AR coating 
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Side View 


A schematic diagram of a real 
solar cell. 


A solar cell showing the blue 
tint due to the AR coating. 


The solar power flux on the earth's surface is (conveniently) about 1 kW/m? 
or 100 mW/cm-. So if we made a solar cell from a 4 inch diameter wafer 
(typical) it would have an area of about 81cm? and so would be receiving a 
flux of about 8.1 Watts. Typical cell efficiencies run from about 10% to 
maybe 15% unless special (and costly) tricks are made. This means that we 
will get about 1.2 Watts out from a single wafer. Looking at B on 2.59 we 
could guess that Vout will be about 0.5 to 0.6 volts, thus we could expect to 
get maybe around 2.5 amps from a 4 inch wafer at 0.5 volts with 15% 
efficiency under the illumination of one sun. 


Properties of Gallium Arsenide 


Gallium: the element 


The element gallium was predicted, as eka-aluminum, by Mendeleev in 
1870, and subsequently discovered by Lecog de Boisbaudran in 1875; in 
fact de Boisbaudran had been searching for the missing element for some 
years, based on his own independent theory. The first experimental 
indication of gallium came with the observation of two new violet lines in 
the spark spectrum of a sample deposited on zinc. Within a month of these 
initial results de Boisbaudran had isolated 1 g of the metal starting from 
several hundred kilograms of crude zinc blende ore. The new element was 
named in honor of France (Latin Gallia), and the striking similarity of its 
physical and chemical properties to those predicted by Mendeleev ([link]) 
did much to establish the general acceptance of the periodic Law; indeed, 
when de Boisbaudran first stated that the density of Ga was 4.7 g/cm? rather 
than the predicted 5.9 g/cm?, Mendeleev wrote to him suggesting that he 
redetermine the value (the correct value is 5.904 g/cm?). 


Mendeleev's Observed properties of 

Property prediction (1871) for gallium (discovered 
eka-aluminum, M 1875) 

eee ca. 68 69.72 

weight 

Density, 

ee 5.9 5.904 
g.cm 
Mens Low 29.78 


point 


Vapor Non-volatile 10° mmHg, 1000 °C 
pressure 


Valence 3 3 
Oxide M,O3 GayO3 
Density 
of oxide 5.5 5.88 
(g/cm?) 
: _ shoul d dissolve Ga metal dissolves 
Properties slowly in acids and 


slowly in acids and 


of metal alkalis and be stable in Albalie end Ge Seble ack 


alr 


Properties M(OH)3 should 
of dissolve in both acids 
hydroxide and alkalis 


Ga(OH)3 dissolves in 
both acids and alkalis 


M salts will tend to Ga salts readily 

form basic salts; the hydrolyze and form basic 

sulfate should form salts; alums are known; 
Properties alums; M>S3 should be GaS3 can be precipitated 
of salts precipitated by H2S or under special conditions 

(NH,)2S; anhydrous by H2S or (NH,)2S, 

MCl3 should be more anhydrous GaCl3 is more 

volatile than ZnCl» volatile than ZnCl. 


Comparison of predicted and observed properties of gallium. 


Gallium has a beautiful silvery blue appearance; it wets glass, porcelain, 
and most other surfaces (except quartz, graphite, and Teflon®) and forms a 
brilliant mirror when painted on to glass. The atomic radius and first 
ionization potential of gallium are almost identical with those of aluminum 
and the two elements frequently resemble each other in chemical properties. 
Both are amphoteric, but gallium is less electropositive as indicated by its 


lower electrode potential. Differences in the chemistry of the two elements 
can be related to the presence of a filled set of 3d orbitals in gallium. 


Gallium is very much less abundant than aluminum and tends to occur at 
low concentrations in sulfide minerals rather than as oxides, although 
gallium is also found associated with aluminum in bauxite. The main source 
of gallium is as a by-product of aluminum refining. At 19 ppm of the earth's 
crust, gallium is about as abundant as nitrogen, lithium and lead; it is twice 
as abundant as boron (9 ppm), but is more difficult to extract due to the lack 
of any major gallium-containing ore. Gallium always occurs in association 
either with zinc or germanium, its neighbors in the periodic table, or with 
aluminum in the same group. Thus, the highest concentrations (0.1 - 1%) 
are in the rare mineral germanite (a complex sulfide of Zn, Cu, Ge, and As); 
concentrations in sphalerite (ZnS), bauxite, or coal, are a hundred-fold less. 


Gallium pnictides 


Gallium's main use is in semiconductor technology. For example, GaAs and 
related compounds can convert electricity directly into coherent light (laser 
diodes) and is employed in electroluminescent light-emitting diodes 
(LED's); it is also used for doping other semiconductors and in solid-state 
devices such as heterojunction bipolar transistors (HBTs) and high power 
high speed metal semiconductor field effect transistors (MESFETs). The 
compound MgGa>O, is used in ultraviolet-activated powders as a brilliant 
green phosphor used in Xerox copying machines. Minor uses are as high- 
temperature liquid seals, manometric fluids and heat-transfer media, and for 
low-temperature solders. 


Undoubtedly the binary compounds of gallium with the most industrial 
interest are those of the Group 15 (V) elements, GaE (E = N, P, As, Sb). 
The compounds which gallium forms with nitrogen, phosphorus, arsenic, 
and antimony are isoelectronic with the Group 14 elements. There has been 
considerable interest, particularly in the physical properties of these 
compounds, since 1952 when Welker first showed that they had 
semiconducting properties analogous to those of silicon and germanium. 


Gallium phosphide, arsenide, and antimonide can all be prepared by direct 
reaction of the elements; this is normally done in sealed silica tubes or in a 
graphite crucible under hydrogen. Phase diagram data is hard to obtain in 
the gallium-phosphorus system because of loss of phosphorus from the bulk 
material at elevated temperatures. Thus, GaP has a vapor pressure of more 
than 13.5 atm at its melting point; as compared to 0.89 atm for GaAs. The 
physical properties of these three compounds are compared with those of 
the nitride in [link]. All three adopt the zinc blende crystal structure and are 
more highly conducting than gallium nitride. 


Property GaN GaP GaAs GaSb 
Mens > 1250 (dec) 1350 1240 712 
point (°C) 
Density 

3 ca. 6.1 4.138 9.3176 9.6137 
(g/cm”) 
Crystal Wiirtzite zinc zinc zinc 
structure blende blende blende 
Cell dimen. a= 3.187,c= a= a= a= 
(A)? 5.186 5.4505 9.6532 6.0959 
Renecuve 2.35 3.178 3.666 4.388 
index 
k (ohm7!cm 9 4an7 10°? - 10 
i 10° - 10 102 ia 6-13 
Band eae 3.44 2.24 1.424 0.71 


(ev) 


Physical properties of 13-15 compound semiconductors. a Values given for 
300 K. b Dependent on photon energy; values given for 1.5 eV incident 
photons. c Dependent on temperature; values given for 300 K. 


Gallium arsenide versus silicon 


Gallium arsenide is a compound semiconductor with a combination of 
physical properties that has made it an attractive candidate for many 
electronic applications. From a comparison of various physical and 
electronic properties of GaAs with those of Si ({link]) the advantages of 
GaAs over Si can be readily ascertained. Unfortunately, the many desirable 
properties of gallium arsenide are offset to a great extent by a number of 
undesirable properties, which have limited the applications of GaAs based 
devices to date. 


Properties GaAs Si 
Formula weight 144.63 28.09 
Crystal structure zinc blende diamond 
Lattice constant 5.6532 5.43095 
Melting point (°C) 1238 1415 
Density (g/cm?) 5.32 2.328 
Thermal conductivity (W/cm.K) 0.46 1.5 
Band gap (eV) at 300 K 1.424 1.12 


Intrinsic carrier conc. (cm™) 1.79 x 10° 1.45 x 10/0 


Intrinsic resistivity (ohm.cm) 108 2.3.x 10° 


Breakdown field (V/cm) 4x 10° 3x 10° 
Minority carrier lifetime (s) 10" 2.5x 10° 
Mobility (cm?/V.s) 8500 1500 


Comparison of physical and semiconductor properties of GaAs and Si. 


Band gap 


The band gap of GaAs is 1.42 eV; resulting in photon emission in the infra- 
red range. Alloying GaAs with Al to give Al,Ga,_,As can extend the band 
gap into the visible red range. Unlike Si, the band gap of GaAs is direct, 
i.e., the transition between the valence band maximum and conduction band 
minimum involves no momentum change and hence does not require a 
collaborative particle interaction to occur. Photon generation by inter-band 
radiative recombination is therefore possible in GaAs. Whereas in Si, with 
an indirect band-gap, this process is too inefficient to be of use. The ability 
to convert electrical energy into light forms the basis of the use of GaAs, 
and its alloys, in optoelectronics; for example in light emitting diodes 
(LEDs), solid state lasers (light amplification by the stimulated emission of 
radiation). 


A significant drawback of small band gap semiconductors, such as Si, is 
that electrons may be thermally promoted from the valence band to the 
conduction band. Thus, with increasing temperature the thermal generation 
of carriers eventually becomes dominant over the intentionally doped level 
of carriers. The wider band gap of GaAs gives it the ability to remain 
‘intentionally’ semiconducting at higher temperatures; GaAs devices are 
generally more stable to high temperatures than a similar Si devices. 


Carrier density 


The low intrinsic carrier density of GaAs in a pure (undoped) form 
indicates that GaAs is intrinsically a very poor conductor and is commonly 
referred to as being semi-insulating. This property is usually altered by 
adding dopants of either the p- (positive) or n- (negative) type. This semi- 
insulating property allows many active devices to be grown on a single 
substrate, where the semi-insulating GaAs provides the electrical isolation 
of each device; an important feature in the miniaturization of electronic 
circuitry, i.e., VLSI (very-large-scale-integration) involving over 100,000 
components per chip (one chip is typically between 1 and 10 mm square). 


Electron mobility 


The higher electron mobility in GaAs than in Si potentially means that in 
devices where electron transit time is the critical performance parameter, 
GaAs devices will operate with higher response times than equivalent Si 
devices. However, the fact that hole mobility is similar for both GaAs and 
Si means that devices relying on cooperative electron and hole movement, 
or hole movement alone, show no improvement in response time when 
GaAs based. 


Crystal growth 


The bulk crystal growth of GaAs presents a problem of stoichiometric 
control due the loss, by evaporation, of arsenic both in the melt and the 
growing crystal (> ca. 600 °C). Melt growth techniques are, therefore, 
designed to enable an overpressure of arsenic above the melt to be 
maintained, thus preventing evaporative losses. The loss of arsenic also 
negates diffusion techniques commonly used for wafer doping in Si 
technology; since the diffusion temperatures required exceed that of arsenic 
loss. 


Crystal Stress 


The thermal gradient and, hence, stress generated in melt grown crystals 
have limited the maximum diameter of GaAs wafers (currently 6" diameter 
compared to over 12" for Si), because with increased wafer diameters the 
thermal stress generated dislocation (crystal imperfections) densities 
eventually becomes unacceptable for device applications. 


Physical strength 


Gallium arsenide single crystals are very brittle, requiring that considerably 
thicker substrates than those employed for Si devices. 


Native oxide 


Gallium arsenide's native oxide is found to be a mixture of non- 
stoichiometric gallium and arsenic oxides and elemental arsenic. Thus, the 
electronic band structure is found to be severely disrupted causing a 
breakdown in 'normal' semiconductor behavior on the GaAs surface. As a 
consequence, the GaAs MISFET (metal-insulator-semiconductor-field- 
effect-transistor) equivalent to the technologically important Si based 
MOSFET (metal-oxide-semiconductor-field-effect-transistor) is, therefore, 
presently unavailable. 


The passivation of the surface of GaAs is therefore a key issue when 
endeavoring to utilize the FET technology using GaAs. Passivation in this 
discussion means the reduction in mid-gap band states which destroy the 
semiconducting properties of the material. Additionally, this also means the 
production of a chemically inert coating which prevents the formation of 
additional reactive states, which can effect the properties of the device. 
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Synthesis and Purification of Bulk Semiconductors 


Introduction 


The synthesis and purification of bulk polycrystalline semiconductor 
material represents the first step towards the commercial fabrication of an 
electronic device. This polycrystalline material is then used as the raw 
material for the formation of single crystal material that is processed to 
semiconductor wafers. The strong influence on the electric characteristics of 
a semiconductors exhibited by small amounts of some impurities requires 
that the bulk raw material be of very high purity (> 99.9999%). Although 
some level of purification is possible during the crystallization process it is 
important to use as high a purity starting material as possible. While a wide 
range of substrate materials are available from commercial vendors, silicon 
and GaAs represent the only large-scale commercial semiconductor 
substrates, and thus the discussion will be limited to the synthesis and 
purification of these materials. 


Silicon 


Following oxygen (46%), silicon (L. silicis flint) is the most abundant 
element in the earth's crust (28%). However, silicon does not occur in its 
elemental form, but as its oxide (SiO>) or as silicates. Sand, quartz, 
amethyst, agate, flint, and opal are some of the forms in which the oxide 
appears. Granite, hornblende, asbestos, feldspar, clay and mica, etc. are a 
few of the numerous silicate minerals. With such boundless supplies of the 
raw material, the costs associated with the production of bulk silicon is not 
one of abstraction and conversion of the oxide(s), but of purification of the 
crude elemental silicon. While 98% elemental silicon, known as 
metallurgical-grade silicon (MGS), is readily produced on a large scale, the 
requirements of extreme purity for electronic device fabrication require 
additional purification steps in order to produce electronic-grade silicon 
(EGS). Electronic-grade silicon is also known as semiconductor-grade 
silicon (SGS). In order for the purity levels to be acceptable for subsequent 
crystal growth and device fabrication, EGS must have carbon and oxygen 
impurity levels less than a few parts per million (ppm), and metal impurities 
at the parts per billion (ppb) range or lower. [link] and [link] give typical 


impurity concentrations in MGS and EGS, respectively. Besides the purity, 
the production cost and the specifications must meet the industry desires. 


Element Concentration Element Concentration 
(ppm) (ppm) 

aluminum 1000-4350 manganese 90-120 

boron 40-60 molybdenum < 20 

calcium 245-500 nickel 10-105 

chromium 50-200 phosphorus 20-50 

copper 15-45 titanium 140-300 

iron 1550-6500 vanadium 50-250 

magnesium 10-50 zirconium 20 


Typical impurity concentrations found in metallurgical-grade silicon (MGS). 


Elenite Concentration Element Concentration 
(ppb) (ppb) 
pene < 0.001 gold < 0.00001 


antimony < 0.001 iron 0.1-1.0 


boron < 0.1 nickel 0.1-0.5 


carbon 100-1000 oxygen 100-400 
chromium < 0.01 phosphorus < 0.3 
cobalt 0.001 silver 0.001 
copper 0.1 zinc <0.1 


Typical impurity concentrations found in electronic-grade silicon (EGS). 


Metallurgical-grade silicon (MGS) 


The typical source material for commercial production of elemental silicon 
is quartzite gravel; a relatively pure form of sand (SiO>). The first step in the 
synthesis of silicon is the melting and reduction of the silica in a submerged- 
electrode arc furnace. An example of which is shown schematically in 
[link], along with the appropriate chemical reactions. A mixture of quartzite 
gravel and carbon are heated to high temperatures (ca. 1800 °C) in the 
furnace. The carbon bed consists of a mixture of coal, coke, and wood chips. 
The latter providing the necessary porosity such that the gases created 
during the reaction (SiO and CO) are able to flow through the bed. 


quartzite, coal submerged electrode 


coke, wood chips CO, SiO, H,O 
naan ee ae 
alias Si0+2C >" 
form SiC 
from SiO and C 
melt SiO, KG co 


te, 


SiC + SiO, > Si+ SiO +CO 


li id ili - 
re discharge of MGSC—> 


Schematic of submerged-electrode arc furnace 
for the production of metallurgical-grade 
silicon (MGS). 


The overall reduction reaction of SiO, is expressed in [link], however, the 
reaction sequence is more complex than this overall reaction implies, and 
involves the formation of SiC and SiO intermediates. The initial reaction 
between molten SiO» and C ([link]) takes place in the arc between adjacent 
electrodes, where the local temperature can exceed 2000 °C. The SiO and 
CO thus generated flow to cooler zones in the furnace where SiC is formed 
({link]), or higher in the bed where they reform SiO, and C ([link]). The SiC 
reacts with molten SiO> ({link]) producing the desired silicon along with 
SiO and CO. The molten silicon formed is drawn-off from the furnace and 


solidified. 
Equation: 


SiO,(liquid) + 2 C(solid) > Si(liquid) + 2 CO (gas) 


Equation: 


>1700 °C 
Si0Q,+2C == Si0+CO 
<1600 °C 


Equation: 
SiO0+2C > SiC +CO (1600 - 1700 °C) 
Equation: 


SiC + SiO, > Si+ SiO + CO 


The as-produced MGS is approximately 98-99% pure, with the major 
impurities being aluminum and iron ({link]), however, obtaining low levels 
of boron impurities is of particular importance, because it is difficult to 
remove and serves as a dopant for silicon. The drawbacks of the above 
process are that it is energy and raw material intensive. It is estimated that 
the production of one metric ton (1,000 kg) of MGS requires 2500-2700 kg 
quartzite, 600 kg charcoal, 600-700 kg coal or coke, 300-500 kg wood chips, 
and 500,000 kWh of electric power. Currently, approximately 500,000 
metric tons of MGS are produced per year, worldwide. Most of the 
production (ca. 70%) is used for metallurgical applications (e.g., aluminum- 
silicon alloys are commonly used for automotive engine blocks) from 
whence its name is derived. Applications in a variety of chemical products 
such as silicone resins account for about 30%, and only 1% or less of the 
total production of MGS is used in the manufacturing of high-purity EGS 
for the electronics industry. The current worldwide consumption of EGS is 
approximately 5 x 10° kg per year. 


Electronic-grade silicon (EGS) 


Electronic-grade silicon (EGS) is a polycrystalline material of exceptionally 
high purity and is the raw material for the growth of single-crystal silicon. 
EGS is one of the purest materials commonly available, see [link]. The 
formation of EGS from MGS is accomplished through chemical purification 


processes. The basic concept of which involves the conversion of MGS to a 
volatile silicon compound, which is purified by distillation, and 
subsequently decomposed to re-form elemental silicon of higher purity (i.e., 
EGS). Irrespective of the purification route employed, the first step is 
physical pulverization of MGS followed by its conversion to the volatile 
silicon compounds. 


A number of compounds, such as monosilane (SiH,), dichlorosilane 
(SiH»Cl)), trichlorosilane (SiHCl3), and silicon tetrachloride (SiCl,), have 
been considered as chemical intermediates. Among these, SiHCl3 has been 
used predominantly as the intermediate compound for subsequent EGS 
formation, although SiH, is used to a lesser extent. Silicon tetrachloride and 
its lower chlorinated derivatives are used for the chemical vapor deposition 
(CVD) growth of Si and SiO>. The boiling points of silane and its 
chlorinated products ([link]) are such that they are conveniently separated 
from each other by fractional distillation. 


Compound Boiling point (°C) 
SiH, -112.3 

SiH3Cl -30.4 

SiH>Cl> 8.3 

SiHCl3 31.5 

SiCl, 57.6 


Boiling points of silane and chlorosilanes at 760 mmHg (1 atmosphere). 


The reasons for the predominant use of SiHCl3 in the synthesis of EGS are 
as follows: 


1. SiHCl; can be easily formed by the reaction of anhydrous hydrogen 
chloride with MGS at reasonably low temperatures (200 - 400 °C); 

2. it is liquid at room temperature so that purification can be 
accomplished using standard distillation techniques; 

3. it is easily handled and if dry can be stored in carbon steel tanks; 

4. its liquid is easily vaporized and, when mixed with hydrogen it can be 
transported in steel lines without corrosion; 

5. it can be reduced at atmospheric pressure in the presence of hydrogen; 
6. its deposition can take place on heated silicon, thus eliminating contact 
with any foreign surfaces that may contaminate the resulting silicon; 

and 
7. it reacts at lower temperatures (1000 - 1200 °C) and at faster rates than 
does SiCly. 


Chlorosilane (Seimens) process 


Trichlorosilane is synthesized by heating powdered MGS with anhydrous 
hydrogen chloride (HCl) at around 300 °C in a fluidized-bed reactor, [link]. 
Equation: 


ca. 300 °C 
Si(solid) + 3 HCl(gas) == SiHCI,(vapor) + H, (gas) 
>900 °C 


Since the reaction is actually an equilibrium and the formation of SiHCl3 
highly exothermic, efficient removal of generated heat is essential to assure 
a maximum yield of SiHCl3. While the stoichiometric reaction is that shown 
in Eq. 5, a mixture of chlorinated silanes is actually prepared which must be 
separated by fractional distillation, along with the chlorides of any 
impurities. In particular iron, aluminum, and boron are removed as FeCl; 
(b.p. = 316 °C), AICI, (m.p. = 190 °C subl.), and BCI (b.p. = 12.65 °C), 
respectively. Fractional distillation of SiHCl3 from these impurity halides 
result in greatly increased purity with a concentration of electrically active 
impurities of less than 1 ppb. 


EGS is prepared from purified SiHCl3 in a chemical vapor deposition 
(CVD) process similar to the epitaxial growth of Si. The high-purity SiHCl3 
is vaporized, diluted with high-purity hydrogen, and introduced into the 
Seimens deposition reactor, shown schematically in [link]. Within the 
reactor, thin silicon rods called slim rods (ca. 4 mm diameter) are supported 
by graphite electrodes. Resistance heating of the slim rods causes the 
decomposition of the SiHCl3 to yield silicon, as described by the reverse 
reaction shown in Eq. 5. 


<—— reaction chamber 


Si-bridge 


Si-slim rod 


Schematic representation of a Seimens 
deposition reactor. 


The shift in the equilibrium from forming SiHCl; from Si at low 
temperature, to forming Si from SiHC]l; at high temperature is as a 
consequence of the temperature dependence ({link]) of the equilibrium 
constant ([link], where p = partial pressure) for [link]. Since the formation of 
SiHCl3 is exothermic, i.e., AH < 0, an increase in the temperature causes the 
partial pressure of SiHCls to decrease. Thus, the Siemens process is 
typically run at ca. 1100 °C, while the reverse fluidized bed process is 
carried out at 300 °C. 

Equation: 


InK, = -AH 
RT 


Equation: 


Psincl, °H, 


PHC 


The slim rods act as a nucleation point for the deposition of silicon, and the 
resulting polycrystalline rod consists of columnar grains of silicon 
(polysilicon) grown perpendicular to the rod axis. Growth occurs at less than 
1 mm per hour, and after deposition for 200 to 300 hours high-purity (EGS) 
polysilicon rods of 150-200 mm in diameter are produced. For subsequent 
float-zone refining the polysilicon EGS rods are cut into long cylindrical 
rods. Alternatively, the as-formed polysilicon rods are broken into chunks 
for single crystal growth processes, for example Czochralski melt growth. 


In addition to the formation of silicon, the HCl] coproduct reacts with the 
SiHCl3 reactant to form silicon tetrachloride (SiCl4) and hydrogen as major 
byproducts of the process, [link]. This reaction represents a major 
disadvantage with the Seimens process: poor efficiency of silicon and 
chlorine consumption. Typically, only 30% of the silicon introduced into 
CVD reactor is converted into high-purity polysilicon. 

Equation: 


HCl + SiHCl, > SiC, +H, 


In order to improve efficiency the HCl, SiCl,, H», and unreacted SiHCl3 are 
separated and recovered for recycling. [link] illustrates the entire 
chlorosilane process starting with MGS and including the recycling of the 
reaction byproducts to achieve high overall process efficiency. As a 
consequence, the production cost of high-purity EGS depends on the 
commercial usefulness of the byproduct, SiCl,. Additional disadvantages of 
the Seimens process are derived from its relatively small batch size, slow 


growth rate, and high power consumption. These issues have lead to the 
investigation of alternative cost efficient routes to EGS. 


Si(MGS) HC] SiC, HCl 


hydrochlorination chlorosilane| Hz, 
of Si (MGS) recovery 
fluidized bed 


reactor 
SiHCl, 


hydrogen 
and HCl 
revovery 


SiHCl, 
(SIH, Cly.x) 


H2 


SiHCls 
vaporization H> 
and chemical vapor 
deposition 


SiHCls 


distillation 


low boiling SiCly 
impurities 
Si (EGS) 


Schematic representation of the reaction pathways 
for the formation of EGS using the chlorosilane 
process. 


Silane process 


An alternative process for the production of EGS that has begun to receive 
commercial attention is the pyrolysis of silane (SiH). The advantages of 
producing EGS from SiH, instead of SiHCls are potentially lower costs 
associated with lower reaction temperatures, and less harmful byproducts. 
Silane decomposes < 900 °C to give silicon and hydrogen, [Link]. 
Equation: 


SiH,(vapor) > Si(solid) + 2 H, (gas) 


Silane may be prepared by a number of routes, each having advantages with 
respect to purity and production cost. The simplest process involves the 
direct reaction of MGS powders with magnesium at 500 °C in a hydrogen 
atmosphere, to form magnesium silicide (Mg»Si). The magnesium silicide is 
then reacted with ammonium chloride in liquid ammonia below 0 °C, [link]. 
Equation: 


Mg,Si+4NH,Cl > SiH, +2 MgCl, +5 NH, 


This process is ideally suited to the removal of boron impurities (a p-type 
dopant in Si), because the diborane (B>H¢) produced during the reaction 
forms the Lewis acid-base complex, H3B(NH3), whose volatility is 
sufficiently lower than SiHy, allowing for the purification of the latter. It is 
possible to prepare EGS with a boron content of < 20 ppt using SiH, 
synthesized in this manner. However, phosphorus (another dopant) in the 
form of PH3 may be present as a contaminant requiring subsequent 
purification of the SiHy. 


Alternative routes to SiH, involve the chemical reduction of SiCl, by either 
lithium hydride ([link]), lithium aluminum hydride ([link]), or via 
hydrogenation in the presence of elemental silicon ([link] - [link]). The 
hydride reduction reactions may be carried-out on relatively large scales (ca. 
50 kg), but only batch processes. In contrast, Union Carbide has adapted the 
hydrogenation to a continuous process, involving disproportionation 
reactions of chlorosilanes ([link] - [link]) and the fractional distillation of 
silane ({link]). 

Equation: 


SiCl,+4LiH > SiH, +4 LiCl 
Equation: 
SiCl, +4 LiAIH, > SiH,+LiCl + AICI, 


Equation: 


SiCl, +2 H, + Si(98%) > 4 SiHCI, 
Equation: 

2 SiHCI, > SiH,Cl, + SiCl, 
Equation: 

3 SiH,Cl, > SiH,CI +2 SiHCI, 
Equation: 


2 SiH,Cl > SiH, + SiH,Cl, 


Pyrolysis of silane on resistively heated polysilicon filaments at 700-800 °C 
yields polycrystalline EGS. As noted above, the EGS formed has 
remarkably low boron impurities compared with material prepared from 
trichlorosilane. Moreover, the resulting EGS is less contaminated with 
transition metals from the reactor container because SiH, decomposition 
does not cause as much of a corrosion problem as is observed with halide 
precursor compounds. 


Granular polysilicon deposition 


Both the chlorosilane (Seimens) and silane processes result in the formation 
of rods of EGS. However, there has been increased interest in the formation 
of granular polycrystalline EGS. This process was developed in 1980’s, and 
relies on the decomposition of SiH, in a fluidized-bed deposition reactor to 

produce free-flowing granular polysilicon. 


Tiny silicon particles are fluidized in a SiH4/Hp flow, and act as seed crystal 
onto which polysilicon deposits to form free-flowing spherical particles. The 
size distribution of the particles thus formed is over the range from 0.1 to 1.5 
mm in diameter with an average particle size of 0.7 mm. The fluidized-bed 


seed particles are originally made by grinding EGS in a ball (or hammer) 
mill and leaching the product with acid, hydrogen peroxide, and water. This 
process is time-consuming and costly, and tended to introduce undesirable 
impurities from the metal grinders. In a new method, large EGS particles are 
fired at each other by a high-speed stream of inert gas and the collision 
breaks them down into particles of suitable size for a fluidized bed. This 
process has the main advantage that it introduces no foreign materials and 
requires no leaching or other post purification. 


The fluidized-bed reactors are much more efficient than traditional rod 
reactors as a consequence of the greater surface area available during CVD 
growth of silicon. It has been suggested that fluidized-bed reactors require 
‘7, to /19 the energy, and half the capital cost of the traditional process. The 
quality of fluidized-bed polysilicon has proven to be equivalent to 
polysilicon produced by the conventional methods. Moreover, granular EGS 
in a free-flowing form, and with high bulk density, enables crystal growers 
to obtain the high, reproducible production yields out of each crystal growth 
run. For example, in the Czochralski crystal growth process, crucibles can 
be quickly and easily filled to uniform loading with granular EGS, which 
typically exceed those of randomly stacked polysilicon chunks produced by 
the Siemens silane process. 


Zone refining 


The technique of zone refining is used to purify solid materials and is 
commonly employed in metallurgical refining. In the case of silicon may be 
used to obtain the desired ultimate purity of EGS, which has already been 
purified by chemical processes. Zone refining was invented by Pfann, and 
makes use of the fact that the equilibrium solubility of any impurity (e.g., 
Al) is different in the solid and liquid phases of a material (e.g., Si). For the 
dilute solutions, as is observed in EGS silicon, an equilibrium segregation 
coefficient (kp) is defined by kg = C./C), where C, and C; are the equilibrium 
concentrations of the impurity in the solid and liquid near the interface, 
respectively. 


If kp is less than 1 then the impurities are left in the melt as the molten zone 
is moved along the material. In a practical sense a molten zone is established 
in a solid rod. The zone is then moved along the rod from left to right. If k < 
1 then the frozen part left on the trailing edge of the moving molten zone 
will be purer than the material that melts in on the right-side leading edge of 
the moving molten zone. Consequently the solid to the left of the molten 
zone is purer than the solid on the right. At the completion of the first pass 
the impurities become concentrated to the right of the solid sample. 
Repetition of the process allows for purification to exceptionally high levels. 
[link]. lists the equilibrium segregation coefficients for common impurity 
and dopant elements in silicon; it should be noted that they are all less than 
1. 


Element ko Element ko 
aluminum 0.002 iron 8x 10° 
boron 0.8 oxygen 0.25 
carbon 0.07 phosphorus 0.35 
copper 4x 10° antimony 0.023 


Segregation coefficients for common impurity and dopant elements in 
silicon. 


Gallium arsenide 


In contrast to electronic grade silicon (EGS), whose use is a minor fraction 
of the global production of elemental silicon, gallium arsenide (GaAs) is 
produced exclusively for use in the semiconductor industry. However, 
arsenic and its compounds have significant commercial applications. The 


main use of elemental arsenic is in alloys of Pb, and to a lesser extent Cu, 
while arsenic compounds are widely used in pesticides and wood 
preservatives and the production of bottle glass. Thus, the electronics 
industry represents a minor user of arsenic. In contrast, although gallium has 
minor uses as a high-temperature liquid seal, manometric fluids and heat 
transfer media, and for low temperature solders, its main use is in 
semiconductor technology. 


Isolation and purification of gallium metal 


At 19 ppm gallium (L. Gallia, France) is about as abundant as nitrogen, 
lithium and lead; it is twice as abundant as boron (9 ppm), but is more 
difficult to extract due to the lack of any major gallium-containing ore. 
Gallium always occurs in association either with zinc or germanium, its 
neighbors in the periodic table, or with aluminum in the same group. Thus, 
the highest concentrations (0.1-1%) are in the rare mineral germanite (a 
complex sulfide of Zn, Cu, Ge, and As), while concentrations in sphalerite 
(ZnS), diaspore [AlO(OH)], bauxite, or coal, are a hundred-fold less. 
Industrially, gallium was originally recovered from the flue dust emitted 
during sulfide roasting or coal burning (up to 1.5% Ga), however, it is now 
obtained as side product of vast aluminum industry and in particular from 
the Bayer process for obtaining alumina from bauxite. 


The Bayer process involves dissolution of bauxite, AlIO,OH3_>,, in aqueous 
NaOH, separation of insoluble impurities, partial precipitation of the 
trihydrate, Al(OH)3, and calcination at 1,200 °C. During processing the 
alkaline solution is gradually enriched in gallium from an initial weight ratio 
Ga/Al of about 1/5000 to about 1/300. Electrolysis of these extracts with a 
Hg cathode results in further concentration, and the solution of sodium 
gallate thus formed is then electrolyzed with a stainless steel cathode to give 
Ga metal. Since bauxite contains 0.003-0.01% gallium, complete recovery 
would yield some 500-1000 tons per annum, however present consumption 
is only 0.1% of this about 10 tons per annum. 


A typical analysis of the 98-99% pure gallium obtained as a side product 
from the Bayer process is shown in [link]. This material is further purified to 


99.99% by chemical treatment with acids and O> at high temperatures 
followed by crystallization. This chemical process results in the reduction of 
the majority of metal impurities at the ppm level, see [link]. Purification to 
seven nines 99.9999% is possible through zone refining, however, since the 
equilibrium distribution coefficient of the residual impurities kp ~ 1, 
multiple passes are required, typically > 500. The low melting point of 
gallium ensures that contamination from the container wall (which is 
significant in silicon zone refining) is minimized. In order to facilitate the 
multiple zone refining in a suitable time, a simple modification of zone 
refining is employed shown in [link]. The gallium is contained in a plastic 
tube wrapped around a rotating cylinder that is half immersed in a cooling 
bath. A heater is positioned above the gallium plastic coil. Thus, establishing 
a series of molten zones that pass upon rotation of the drum by one helical 
segment per revolution. In this manner, 500 passes may be made in 
relatively short time periods. The typical impurity levels of gallium zone 
refined in this manner are given in [link]. 


Element 


aluminum 
calcium 
copper 
iron 

lead 


magnesium 


Bayer 
process 
(ppm) 
100-1,000 
10-100 
100-1,000 
100-1,000 
< 2000 


10-100 


After acid/base 
leaching (ppm) 


7 
not detected 
2 
7 


30 


500 zone 
Passes 


(ppm) 

<1 

not detected 
<1 

| 

not detected 


not detected 


mercury 


nickel 


silicon 


tin 


titanium 


zinc 


10-100 
10-100 
10-100 
10-100 
10-100 


30,000 


not detected not detected 


not detected not detected 
x1 not detected 
x1] not detected 
1 <1 

x] not detected 


Typical analysis of gallium obtained as a side product from the Bayer 


process. 


heater . 
gallium 
contained ina 


rotating drum 


plastic tube 


Schematic representation of a zone refining 


apparatus. 


Isolation and purification of elemental arsenic 


Elemental arsenic (L. arsenicum, yellow orpiment) exists in two forms: 
yellow (cubic, As,) and gray or metallic (rhombohedral). At a natural 
abundance of 1.8 ppm arsenic is relatively rare, however, this is offset by its 
presence in a number of common minerals and the relative ease of isolation. 


Arsenic containing minerals are grouped into three main classes: the sulfides 
realgar (As,4S,) and orpiment (As>S3), the oxide arsenolite (As,O3), and the 
arsenides and sulfaresenides of the iron, cobalt, and nickel. Minerals in this 
latter class include: loellinginite (FeAs>), safforlite (CoAs), niccolite (NiAs), 
rammelsbergite (NiAs>), ansenopyrite or mispickel (FeAsS), cobaltite 
(CoAsS), enargite (Cu3AsS,), gerdsorfite (NiAsS), and the quarturnary 
sulfide glaucodot [(Co,Fe)AsS]. [link] shows the typical impurities in 
arsenopyrite. 


Element Concentration Element Concentration 
(ppm) (ppm) 

silver 90 nickel < 3,000 

gold 8 lead 50 

cobalt 30,000 platinum 0.4 

copper 200 rhenium 50 

germanium 30 selenium 50 

manganese 3,000 vanadium 300 

molybdenum 60 zinc 400 


Typical impurities in arsenopyrite. 


Arsenic is obtained commercially by smelting either FeAs» or FeAsS at 650- 
700 °C in the absence of air and condensing the sublimed element (Ts,4 = 
613 °C), [link]. 

Equation: 


650-700 °C <613 °C 
FeAsS > FeS+As(vapor) >  As(solid) 


The arsenic thus obtained is combined with lead and then sublimed (T,,, = 
614 °C) which binds any sulfur impurities more strongly than arsenic. Any 
residual arsenic that remains trapped in the iron sulfide is separated by 
forming the oxide (As»O3) by roasting the sulfide in air. The oxide is 
sublimed into the flue system during roasting from where it is collected and 
reduced with charcoal at 700-800 °C to give elemental arsenic. 
Semiconductor grade arsenic (> 99.9999%) is formed by zone refining. 


Synthesis and purification of gallium arsenide. 


Gallium arsenide can be prepared by the direct reaction of the elements, 
[link]. However, while conceptually simple the synthesis of GaAs is 
complicated by the different vapor pressures of the reagents and the highly 
exothermic nature of the reaction. Furthermore, since the synthesis of GaAs 
at atmospheric pressure is accompanied by its simultaneous decomposes due 
to the loss by sublimation, of arsenic, the synthesis must be carried out 
under an overpressure of arsenic in order to maintain a stoichiometric 
composition of the synthesized GaAs. 

Equation: 


>1240 °C 
Ga(liquid) + As(vapor) > GaAs(solid) 


In order to overcome the problems associated with arsenic loss, the reaction 
is usually carried out in a sealed reaction tube. However, if a stoichiometric 
quantity of arsenic is used in the reaction a constant temperature of 1238 °C 
must be employed in order to maintain the desired arsenic overpressure of 1 
atm. Practically, it is easier to use a large excess of arsenic heated to a lower 
temperature. In this situation the pressure in the tube is approximately equal 
to the equilibrium vapor pressure of the volatile component (arsenic) at the 
lower temperature. Thus, an over pressure of 1 atm arsenic may be 


maintained if within a sealed tube elemental arsenic is heated to 600-620 °C 
while the GaAs is maintained at 1240-1250 °C. 


[link] shows the sealed tube configuration that is typically used for the 
synthesis of GaAs. The tube is heated within a two-zone furnace. The boats 
holding the reactants are usually made of quartz, however, graphite is also 
used since the latter has a closer thermal expansion match to the GaAs 
product. If higher purity is required then pyrolytic boron nitride (PBN) is 
used. One of the boats is loaded with pure gallium the other with arsenic. A 
plug of quartz wool may be placed between the boats to act as a diffuser. 
The tube is then evacuated and sealed. Once brought to the correct reaction 
temperatures ([link]), the arsenic vapor is transported to the gallium, and 
they react to form GaAs in a controlled manner. [link] gives the typical 
impurity concentrations found in polycrystalline GaAs. 
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600 - 620 °C 1240 - 1260 °C 


Schematic representation of a sealed tube 


synthesis of GaAs. 
Ficmeni Concentration Ficnient Concentration 
(ppm) (ppm) 


boron 0.1 silicon 0.02 


carbon 0.7 phosphorus 0.1 


nitrogen 0.1 sulfur 0.01 
oxygen 0.5 chlorine 0.08 
fluorine 0.2 nickel 0.04 
magnesium 0.02 copper 0.01 
aluminum 0.02 zinc 0.05 


Impurity concentrations found in polycrystalline GaAs. 


Polycrystalline GaAs, formed in from the direct reaction of the elements is 
often used as the starting material for single crystal growth via Bridgeman or 
Czochralski crystal growth. It is also possible to prepare single crystals of 
GaAs directly from the elements using in-situ, or direct, compounding 
within a high-pressure liquid encapsulated Czochralski (HPLEC) technique. 
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Growth of Gallium Arsenide Crystals 


Introduction 


When considering the synthesis of Group 13-15 compounds for electronic 
applications, the very nature of semiconductor behavior demands the use of 
high purity single crystal materials. The polycrystalline materials 
synthesized above are, therefore, of little use for 13-15 semiconductors but 
may, however, serve as the starting material for melt grown single crystals. 
For GaAs, undoubtedly the most important 13-15 (III - V) semiconductor, 
melt grown single crystals are achieved by one of two techniques: the 
Bridgman technique, and the Czochralski technique. 


Bridgman growth 


The Bridgman technique requires a two-zone furnace, of the type shown in 
[link]. The left hand zone is maintained at a temperature of ca. 610 °C, 
allowing sufficient overpressure of arsenic within the sealed system to 
prevent arsenic loss from the gallium arsenide. The right hand side of the 
furnace contains the polycrystalline GaAs raw material held at a 
temperature just above its melting point (ca. 1240 °C). As the furnace 
moves from left to right, the melt cools and solidifies. If a seed crystal is 
placed at the left hand side of the melt (at a point where the temperature 
gradient is such that only the end melts), a specific orientation of single 
crystal may be propagated at the liquid-solid interface eventually to produce 
a single crystal. 


furnace zone 1 furnace zone 2 


; seed GaAs 
arsenic crystal charge 


Direction of furnace travel 


A schematic diagram of a Bridgman two-zone 
furnace used for melt growths of single crystal 
GaAs. 


Czochralski growth 


The Czochralski technique, which is the most commonly used technique in 
industry, is shown in [link]. The process relies on the controlled withdrawal 
of a seed crystal from a liquid melt. As the seed is lowered into the melt, 
partial melting of the tip occurs creating the liquid solid interface required 
for crystal growth. As the seed is withdrawn, solidification occurs and the 
seed orientation is propagated into the grown material. The variable 
parameters of rate of withdrawal and rotation rate can control crystal 
diameter and purity. As shown in [link] the GaAs melt is capped by boron 
trioxide (B03). The capping layer, which is inert to GaAs, prevents arsenic 
loss when the pressure on the surface is above atmospheric pressure. The 
growth of GaAs by this technique is thus termed liquid encapsulated 
Czochralski (LEC) growth. 
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A schematic diagram of the Czochralski 
technique as used for growth of GaAs single 
crystal bond. 


While the Bridgman technique is largely favored for GaAs growth, larger 
diameter wafers can be obtained by the Czochralski method. Both of these 
melt techniques produce materials heavily contaminated by the crucible, 
making them suitable almost exclusively as substrate material. Another 
disadvantage of these techniques is the production of defects in the material 
caused by the melt process. 
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Ceramic Processing of Alumina 


Introduction 


While aluminum is the most abundant metal in the earth's crust (ca. 8%) 
and aluminum compounds such as alum, K[AI(SO,)].12(H2O), were 
known throughout the world in ancient times, it was not until the isolation 
of aluminum in the late eighteenth century by the Danish scientist H. C. 
Oersted that research into the chemistry of the Group 13 elements began in 
earnest. Initially, metallic aluminum was isolated by the reduction of 
aluminum trichloride with potassium or sodium; however, with the advent 
of inexpensive electric power in the late 1800's, it became economically 
feasible to extract the metal via the electrolyis of alumina (Al,O3) dissolved 
in cryolite, Na3AlF¢, (the Hall-Heroult process). Today, alumina is prepared 
by the Bayer process, in which the mineral bauxite (named for Les Baux, 
France, where it was first discovered) is dissolved with aqueous hydroxides, 
and the solution is filtered and treated with CO> to precipitate alumina. 
With availability of both the mineral and cheap electric power being the 
major considerations in the economical production of aluminum, it is not 
surprising that the leading producers of aluminum are the United States, 
Japan, Australia, Canada, and the former Soviet Union. 


Aluminum oxides and hydroxides 


The many forms of aluminum oxides and hydroxides are linked by complex 
structural relationships. Bauxite has the formula Al,(OH)3.9, (0 < x < 1) 
and is thus a mixture of Al5O3 (a-alumina), Al(OH) 3 (gibbsite), and 
AlO(OH) (boehmite). The latter is an industrially important compound 
which is used in the form of a gel as a pre-ceramic in the production of 
fibers and coatings, and as a fire retarding agent in plastics. 


Heating boehmite and diaspore to 450 °C causes dehydration to yield forms 
of alumina which have structures related to their oxide-hydroxide 
precursors. Thus, boehmite produces the low-temperature form y-alumina, 
while heating diaspore will give a-alumina (corundum). y-alumina converts 
to the hcp structure at 1100 °C. A third form of Al»O3 forms on the surface 
of the clean aluminum metal. The thin, tough, transparent oxide layer is the 


reason for much of the usefulness of aluminum. This oxide skin is rapidly 
self-repairing because its heat of formation is so large (AH = -3351 kJ/mol). 
Equation: 


4Al +30, > 2A1,0, 


Ternary and mixed-metal oxides 


A further consequence of the stability of alumina is that most if not all of 
the naturally occurring aluminum compounds are oxides. Indeed, many 
precious gemstones are actually corundum doped with impurities. 
Replacement of aluminum ions with trace amounts of transition-metal ions 
transforms the formerly colorless mineral into ruby (red, Cr°*), sapphire 
(blue, Fe**/3*, Ti**), or topaz (yellow, Fe**). The addition of stoichiometric 
amounts of metal ions causes a shift from the a-Al,O3 hcp structure to the 
other common oxide structures found in nature. Examples include the 
perovskite structure for ABO3 type minerals (e.g., CeTiO7 or LaAlO3) and 
the spinel structure for AB»O, minerals (e.g., beryl, BeAl Ox). 


Aluminum oxide also forms ternary and mixed-metal oxide phases. Ternary 
systems such as mullite (AlgSi,O;3), yttrium aluminum garnet (YAG, 
Y3AI150 >), the B-aluminas (e.g., NaAl,;;0,7) and aluminates such as 
hibonite (CaAl,»019) possessing B-alumina or magnetoplumbite-type 
structures can offer advantages over those of the binary aluminum oxides. 


Applications of these materials are found in areas such as engineering 
composite materials, coatings, technical and electronic ceramics, and 
catalysts. For example, mullite has exceptional high temperature shock 
resistance and is widely used as an infrared-transparent window for high 
temperature applications, as a substrate in multilayer electronic device 
packaging, and in high temperature structural applications. Hibonite and 
other hexaluminates with similar structures are being evaluated as 
interfacial coatings for ceramic matrix composites due to their high thermal 
stability and unique crystallographic structures. Furthermore, aluminum 
oxides doped with an alkali, alkaline earth, rare earth, or transition metal are 


of interest for their enhanced chemical and physical properties in 
applications utilizing their unique optoelectronic properties. 


Synthesis of aluminum oxide ceramics 


In common with the majority of oxide ceramics, two primary synthetic 
processes are employed for the production of aluminum oxide and mixed 
metal oxide materials: 


1. The traditional ceramic powder process. 
2. The solution-gelation, or "sol-gel" process. 


The environmental impact of alumina and alumina-based ceramics is in 
general negligible; however, the same cannot be said for these methods of 
preparation. As practiced commercially, both of the above processes can 
have a significant detrimental environmental impact. 


Traditional ceramic processing 


Traditional ceramic processing involves three basic steps generally referred 
to as powder-processing, shape-forming, and densification, often with a 
final mechanical finishing step. Although several steps may be energy 
intensive, the most direct environmental impact arises from the shape- 
forming process where various binders, solvents, and other potentially toxic 
agents are added to form and stabilize a solid ("green") body ({link]). 


Volume 


Function Composition (%) 


Powder alumina (Al,O3) 27 


Solvent 1,1,1-trichloroethane/ethanol 58 
Deflocculant menhaden oil 1.8 
Binder poly(vinyl butyrol) 4.4 


poly(ethylene glycol)/octyl 


phthalate oe 


Plasticizer 


Typical composition of alumina green body 


The component chemicals are mixed to a slurry, cast, then dried and fired. 
In addition to any innate health risk associated with the chemical processing 
these agents are subsequently removed in gaseous form by direct 
evaporation or pyrolysis. The replacement of chlorinated solvents such as 
1,1,1-trichloroethylene (TCE) must be regarded as a high priority for 
limiting environmental pollution. The United States Environmental 
Protection Agency (EPA) included TCE on its 1991 list of 17 high-priority 
toxic chemicals targeted for source reduction. The plasticizers, binders, and 
alcohols used in the process present a number of potential environmental 
impacts associated with the release of combustion products during firing of 
the ceramics, and the need to recycle or discharge alcohols which, in the 
case of discharge to waterways, may exert high biological oxygen demands 
in the receiving communities. It would be desirable, therefore, to be able to 
use aqueous processing; however, this has previously been unsuccessful due 
to problems associated with batching, milling, and forming. Nevertheless, 
with a suitable choice of binders, etc., aqueous processing is possible. 
Unfortunately, in many cast-parts formed by green body processing the 
liquid solvent alone consists of over 50 % of the initial volume, and while 
this is not directly of an environmental concern, the resultant shrinkage 
makes near net shape processing difficult. 


Sol-gel 


Whereas the traditional sintering process is used primarily for the 
manufacture of dense parts, the solution-gelation (sol-gel) process has been 


applied industrially primarily for the production of porous materials and 
coatings. 


Sol-gel involves a four stage process: dispersion, gelation, drying, and 
firing. A stable liquid dispersion or sol of the colloidal ceramic precursor is 
initially formed in a solvent with appropriate additives. By changing the 
concentration (aging) or pH, the dispersion is "polymerized" to form a solid 
dispersion or gel. The excess liquid is removed from this gel by drying and 
the final ceramic is formed by firing the gel at higher temperatures. 


The common sol-gel route to aluminum oxides employs aluminum 
hydroxide or hydroxide-based material as the solid colloid, the second 
phase being water and/or an organic solvent, however, the strong 
interactions of the freshly precipitated alumina gels with ions from the 
precursor solutions makes it difficult to prepare these gels in pure form. To 
avoid this complication, alumina gels are also prepared from the hydrolysis 
of aluminum alkoxides, Al(OR)3. 

Equation: 


AI(OR), + H,O/Ht > Al-gel 


Equation: 


A 
Al-gel > ALO, 


The exact composition of the gel in commercial systems is ordinarily 
proprietary, however, a typical composition will include an aluminum 
compound, a mineral acid, and a complexing agent to inhibit premature 
precipitation of the gel, e.g., [link]. 


Function Composition 


Boehmite precursor ASB [aluminum sec-butoxide, Al(OC4Ho)3] 
Electrolyte HNO3 0.07 mole/mole ASB 
Complexing agent glycerol ca. 10 wt.% 


Typical composition of an alumina sol-gel for slipcast ceramics. 


The principal environmental consequences arising from the sol-gel process 
are those associated with the use of strong acids, plasticizers, binders, 
solvents, and sec-butanol formed during the reaction. Depending on the 
firing conditions, variable amounts of organic materials such as binders and 
plasticizers may be released as combustion products. NO,’s may also be 
produced in the off-gas from residual nitric acid or nitrate salts. Moreover, 
acids and solvents must be recycled or disposed of. Energy consumption in 
the process entails “upstream” environmental emissions associated with the 
production of that energy. 
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Piezoelectric Materials Synthesis 


This module was developed as part of the Rice University course CHEM-496: Chemistry of Electronic 
Materials. This module was prepared with the assistance of Ilse Y. Guzman-Jimenez. 


Introduction 


Piezoelectricity is the generation of an electric moment by a change of stress applied to a solid. The word 
piezoelectricity literally means “pressure electricity”; the prefix piezo is derived from the Greek word 
piezein, “to press”. The piezoelectric effect was discovered in 1880 by the brothers Jacques and Pierre 
Curie. Not only did they demonstrate the phenomenon, but they also established the criteria for its 
existence in a given crystal. Of the thirty-two crystal classes, twenty-one are non-centrosymmetric (not 
having a centre of symmetry), and of these, twenty exhibit direct piezoelectricity. 


The first practical application of the piezoelectric effect was developed when ground quartz crystals were 
placed between the plates of a tuning capacitor in order to stabilize oscillating circuits in radio transmitters 
and receivers; however, the phenomenon of piezoelectricity was not well exploited until World War I, 
when Langevin used piezoelectrically excited quartz plates to generate sounds waves in water for use in 
submarine detection. 


Piezoelectricity can also occur in polycrystalline or amorphous substances which have become anisotropic 
by external agents. Synthetic piezoelectric materials became available near the end of World War II, with 
the accidental discovery of the fact that materials like barium titanate and rare earth oxides become 
piezoelectric when they are polarized electrically. During the postwar years, when germanium and silicon 
were revolutionizing the electronics industry, piezoceramics appeared for a while to be joining the 
revolution, but the limited availability of materials and components, made the piezoelectric phenomenon 
failed to lead mature applications during the 1950s. It is only now that a variety of piezoelectric materials 
are being synthesized and optimized. As a consequence piezoelectric-based devices are undergoing a 
revolutionary development, specially for medicine and aerospace applications. 


Piezoelectric ceramics 


Most piezoelectric transducers are made up of ceramic materials for a broad range of electromechanical 
conversion tasks as transmitters, ranging from buzzers in alarm clocks to sonars, and as receivers, ranging 
from ultra high frequency (UHF) filters to hydrophones. 


Most of the piezoelectric materials in usage are from the lead zirconate titanate (PZT) family, because of 
their excellent piezoelectric parameters, thermal stability, and dielectric properties. Additionally the 
properties of this family can be modified by changing the zirconium to titanium ratio or by addition of both 
metallic and non-metallic elements. PZT (PbZr,_,TixO3) ceramics and their solid solutions with several 
complex perovskite oxides have been studied; among the various complex oxide materials, niobates have 
attracted special attention. Ternary ceramic materials, lead metaniobate, as well as, barium and modified 
lead titanates complete the list of piezoceramic materials. 


Selective parameters for piezoceramic materials are given in [link], where Q,, is the mechanical quality 
factor, T, is the Curie point, d3; is the the transverse charge coefficient, and kp, k;, and k3, are the 
electromechanical coupling factors for planar, thickness, and transversal mode respectively. 


Material PZT Lead PSZNT PZT, PSN- TsTS- PZT, 


property modified metaniobate 31/40/29 x= PLT 42-1 x= 
0.5 50/50 0.48 

Qn 350 40 222 74 41 887 

T. (°C) 290 462 369 152 355 

CN 50 

kp 0.5 60 0.428 30.7 46.5 

k 0.32 0.438 - 

k31 0.21 0.263 17.9 


Selective parameters for illustrative piezoceramic materials. 


Recently, sol-gel processing has been used to prepare ceramics, making possible the preparation of 
materials that are difficult to obtain by conventional methods. Both, inorganic and organic precursor have 
been reported. Additionally, new techniques for the production of ceramic fibers have been developed. 
Better processing and geometrical and microestructural control are the main goals in the production of 
fibers. 


The latest development in piezoceramic fibers is the modification of the viscous-suspension-spinning 
process (VSSP) for the production of continuos piezoelectric ceramic fibers for smart materials and active 
control devices, such as transducers, sensor/actuators and structural-control devices. The VSSP utilizes 
conventional synthesized ceramic powders and cellulose, as the fugitive carrier, to produce green ceramic 
fiber at a reasonable cost. [link] shows the schematic representation of the VSSP. 
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The viscous-suspension-spinning process (VSSP) for the production 
of continuous piezoceramic fiber. 


Synthesis of reactive PZT precursor powder by the oxalate coprecipitation technique has also been 
developed. The precursor transforms to phase pure PZT at or above 850 °C the PZT obtained by this 
technique showed a Curie temperature of 355 °C. The advantages of the coprecipitation technique are the 
lack of moisture sensitive and special handling precursors. 


Although new materials have been investigated with the purpose of create replacements for ceramics, there 
has been a great improvement in their properties and, current research is focused in the development of 


new techniques for both synthesis and processing. 


Piezoelectric single crystals. 


The recent progress of the electronic technology requires new piezoelectric crystals with a high thermal 
stability and large electromechanical coupling factors. Single-crystal materials have been considered as 
replacements for polycrystalline ceramics. Ideally single-crystals of lead zirconate titanate (PZT) itself 
would be the main choice as it is the most prevailing piezoelectric material, but it is difficult to grow large 
single crystals. On the other hand, the fact that single-crystals offer many advantages over polycrystalline 
systems has been recognized. Materials such as lithium niobate present essentially no aging, no mechanical 
creep and excellent performance in high temperature conditions. 


New piezoelectric single crystals grown by conventional RF-heating Czochralski (CZ) technique have 
been synthesized. High purity starting materials, mainly oxides powders, and Ar atmosphere are required. 
La3GasSiOj4, Lag3Nbg 5Gas.5O14 and La3Tag.5Gas.5O1,4 single crystals have been grown by using this 
method. However, the CZ technique can be applied only to materials that can be synthesized by ordinary 
solid-state reaction and can undergo the pulling method. 


BaBe)Si,O7 (barylite) has been known as material with a strong piezoelectricity, however, it can not be 
obtained by solid-state reaction and CZ technique therefore is not applicable. As an alternative for 
piezoelectric crystals growth hydrothermal synthesis has been developed. [link] shows the experimental 
apparatus for the growth of barylite. Eventhough, crystals can be obtained using this technique, high 
pressure (500 - 1000 bar) and a solvent for the raw materials are required. 


Experimental 
apparatus for the 
hydrothermal 
synthesis of 
barylite. H = heater, 
F = furnace, S = 
specimen vessel, G 
= growth capsule, P 
= pressure gauge, 
and T = 
thermocouples. 


Adapted from M. 
Maeda, T. Uehara, 
H. Sato and T. 
Ikeda, Jpn. J. Appl. 
Phys., 1991, 30, 
2240. 


While the piezoceramics dominate the single crystal materials in usage, single crystals piezoelectrics 
continue to make important contributions both in price-conscious consumer market and in performance- 
driven defense applications. Areas such as frequency stabilized oscillators, surface acoustic wave devices 
and filters with a wide pass band, are still dominated by single crystals. 


Piezoelectric thin films 


Recently, there has been great interest in the deposition of piezoelectric thin films, mainly for 
microelectronical systems (MEMS) applications; where the goal is to integrate sensors and actuators based 
on PZT films with Si semiconductor-based signal processing; and for surface acoustic wave (SAW) 
devices; where the goal is to achieve higher electromechanical coupling coefficient and temperature 
stability. Piezoelectrical microcantilevers, microactuators, resonators and SAW devices using thin films 
have been reported. 


Several methods have been investigated for PZT thin films. In the metallo-organic thin film deposition, 
alkoxides are stirred during long periods of time (up to 18 hours). After pyrolisis, PZT amorphous films are 
formed and then calcination between 400 — 600 °C for 80 hours leads to PZT crystallization (perovskita 
phase) by a consecutive phase transformation process, which involves a transitional pyrochlore phase. 


A hybrid metallorganic decomposition (MOD) route has also been developed to prepare PZT thin films. 
Lead and titanium acetates and, zirconium acetylacetonate are used. The ferroelectric piezoelectric and 
dielectric properties indicate that the MOD route provides PZT films of good quality and comparable to 
literature values. In addition to being simple, MOD has several advantages which include: homogeneity at 
molecular level and ease composition control. 


Metalorganic chemical vapor deposition (MOCVD) has been applied to PZT thin films deposition also. It 
has been proved that excellent quality PZT films can be grown by using MOCVD, but just recently the 
control of microstructure the deposition by varying the temperature, Zr to Ti ratio and precursors flow has 
been studied. Recent progress in PZT films deposition has led to lower temperature growth and it is 
expected that by lowering the deposition temperature better electrical properties can be achieved. 
Additionally, novel techniques such as KrF excimer laser ablation and, ion and photo-assisted depositions, 
have also been used for PZT films synthesis. 


On the other hand, a single process to deposit PZT thin film by a hydrothermal method has been reported 
recently. Since the sol-gel method, sputtering and chemical vapor deposition techniques are useful only for 
making flat materials, the hydrothermal method offers the advantage of making curved shaped materials. 
The hydrothermal method utilizes the chemical reaction between titanium and ions melted in solution. A 
PZT thin film has been successfully deposited directly on a titanium substrate and the optimum ion ratio in 
the solution is being investigated to improve the piezoelectric effect. 


Among the current reported piezoelectric materials, the Pb(Ni,/3Nb2/3)9.2Zro,4Tig. 403 (PNNZT, 2/4/4) 
ferroelectric ceramic has piezoelectric properties that are about 60 and 3 times larger than the reported 
values for ZnO and PZT. A sol-gel technique has been developed for the deposition of a novel piezoelectric 


PNNZT thin film. A 2-methoxyethanol based process is used. In this process precursors are heated at lower 
temperature than the boiling point of the solvent, to distill off water. Then prior high temperature 
annealing, addition of excess Pb precursor in the precursor solution is required to compensate the lead loss. 
The pure perovskite phase is then obtained at 600 °C, after annealing. 


Thin films of zinc oxide (ZnO), a piezoelectric material and n-type wide-bandgap semiconductor, have 
been deposited. ZnO films are currently used in SAW devices and in electro-optic modulators. ZnO thin 
films have been grown by chemical vapor deposition and both d.c. and r.f. sputtering techniques. Recently, 
optimization of ZnO films by r.f. magnetron sputtering has been developed. However, homogeneity is one 
of the main problems when using this technique, since films grown by this optimized method, showed two 
regions with different piezoelectric properties. 


DC magnetron sputtering is other technique for piezoelectric thin film growth, recently aluminum nitride, a 
promising material for use in thin-film bulk acoustic wave resonators for applications in RF bandpass 
filters, has been grown by this method. The best quality films are obtained on Si substrates. In order to 
achieve the highest resonator coupling, the AIN must be grown directly on the electrodes. The main 
problem in the AIN growth is the oxygen contamination, which leads to the formation of native oxide on 
the Al surface, preventing crystalline growth of AIN. 


Piezoelectric polymers 


The discovery of piezoelectricity in polymeric materials such as polyvinylidene difluoride (PVF), was 
considered as an indication of a renaissance in piezoelectricity. Intensive research was focused in the 
synthesis and functionalization of polymers. A potential piezoelectric polymer has to contain a high 
concentration of dipoles and also be mechanically strong and film-forming. The degree of crystallinity and 
the morphology of the crystalline material have profound effects on the mechanical behavior of polymers. 
Additionally, in order to induce a piezoelectric response in amorphous systems the polymer is poled by 
application of a strong electric field at elevated temperature sufficient to allow mobility of the molecular 
dipoles in the polymer. Recent approaches have been focused in the development of cyano-containing 
polymers, due to the fact that cyano polymers could have many dipoles which can be aligned in the same 
direction. 


Phase transfer catalyzed reaction has been used for piezoelectric polymer preparation from malonitrile, 
however this method leads to low molecular weight, and low yield of impure vinylidene cyanide units 
containing material. The use of solid K>CO3 and acetonitrile without added phase transfer catalyst shows 
excellent yields for polyester possessing backbone gem-dinitriles and for polyamide synthesis. The 
polyester and polyamide obtained contained a dinitrile group net dipole which can be align in the same 
direction as the carbonyl groups. 


The pursuit for better piezoelectric polymers has led to molecular modeling which indicates that one cyano 
substituent should be almost as effective as two geminal cyano substituents, opening a new area of 
potential materials having an acrylonitrile group as the basic building block. However, polyacrilonitrile 
itself is not suitable because it forms a helix. Thus acrylonitrile copolymers have been investigated. 


Most of the piezoelectric polymers available are still synthesized by conventional methods such as 
polycondensation and radical polymerization. Therefore piezoelectric polymer synthesis has the same 
problems as the commercial polymer preparation, such as controlling the degree of polymerization and 
crystallinity. 


A novel technique of vapor deposition polymerization has been reported as an alternative method to 
copolymeric thin films. Aliphatic polyurea 9 was synthesized by evaporating monomers of 1,9- 
diaminononano and 1,9-diisocyanatononano onto glass substrate in vacuum. Deposition rates were 
improved at temperatures below 0 °C. After poling treatment films showed fairly large piezoelectric 


activities. Additionally, a completely novel approach to piezoelectric polymers has been presented. This 
approach, consists in the synthesis of ordered piezoelectric polymer networks via crosslinking of liquid- 
crystalline monomers. The main goal in this approach is to achieve a polymer network which combines the 
long term stability of piezoelectric single-crystals with the ease of processability and fabrication of 
conventional polymers. [link] shows the schematic representation of this approach. 
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Scheme of a ordered piezoelectric networks via a liquid-crystalline 
monomer strategy. Adapted from D. L. Gin and B. C. Baxter, 
Polymer Preprints, 1996, 38, 211. 


Piezoelectric polymers are becoming increasingly important commercially because of their easier 
processability, lower cost, and higher impact resistance than ceramics, but the lack of high temperature 
stability and the absence of a solid understanding of the molecular level basis for the electrical properties 
are limitations. The requirements for strong piezoelectricity in a polymer are: the polymer chain has a 
larger resultant dipole moment normal to the chain axis; polymer crystallizes into a polar crystal with the 
polar axis perpendicular to the chain axis, has a high crystallinity and finally the polymer polar axis aligns 
easily in the thickness direction during poling. 


Piezoelectric composites 


Piezocomposites have been obtained by the combination of piezoelectric ceramics and polymers, the 
resulting material posses both the high piezoelectric properties of ceramics and the processability of 
polymers. 1-3 type piezocomposites have found wide applications as medical and industrial ultrasonic 
transducers. 


The current method for piezocomposite production is the dice-and-fill technique, which consists in cutting 
two sets of grooves in a block of piezoceramic at right angles each other, then a polymer is cast into these 
grooves and the solid ceramic base is ground off. Polishing and poling are the following steps in order to 
achieve the final thickness and properties. This method is expensive, time consuming and size limited. 


As an alternative for the dice-and-fill technique, continuos green fibers obtained by the modified viscous- 
suspension-spinning process, can be bundled into a cottonball-like shape, then burned and sintered. The 
sintered bundle impregnated with epoxy resin can be sliced into discs and then polarized. Recent results 
have yielded 1-3 type composites with excellent piezoelectric properties. 


On the other hand, an innovative process has been developed for Srj(Nbp.5Tag.5)207/PVDF composites, in 
this new fabrication method, appropriate amounts of oxides are mixed, pressed and sintered. The porous 
resulting material is subsequently infiltrated with PVDF solution and then poled. This new method for 
composites preparation is simple and offers a lead-free alternative smart material. 


Another kind of piezocomposites can be achieved by spinning films of piezoceramic onto metal alloys, 
such as TiNi. The resulting materials is a hybrid composite that can utilize the different active and adaptive 


properties of the individual bulk materials. Due to the shape memory nature of TiNi, a possible application 
for this new heterostructures could be smart active damping of mechanical vibrations. DC sputtering and 
spin coating are the techniques necessary for the smart thin film TiNi/piezoelectric heterostructures 
fabrication. However, eventhough the films had a fine grain structure and high mechanical qualities, the 
ferroelectric properties were poor compared to literature values. 


In the future, the properties of piezocomposites will be tailored, by varying the ceramic, the polymer and 
their relative proportions. Adjustments in the material properties will lead to fulfillment of the 
requirements for a particular device. [link] shows a comparison among piezoelectric ceramics, polymers 
and composites parameters where Z is the impedance, €'33 is the dielectrical constant, and p is the density. 


Material parameter Piezoceramics Piezopolymers Piezocomposites 
k, (%) 45-55 20 - 30 60 - 75 

Z (10° Rayls) 20 - 30 15-4 4-20 

€'33/€9 200 - 5000 ~10 50 - 2500 

tan y (%) <1 15-5 <1 

Qn 10 - 1000 5-10 2-50 

p (10° kg/m?) 5.5-8 1-2 2-5 


Parameter ranges for piezoelectric ceramics, polymers and composites. 


Piezoelectric coatings. 


Many potential applications exist which require film thickness of 1 to 30 ym. Some examples of these 
macroscopic devices include ultrasonic high frequency transducers, fiber optic modulators and for self 
controlled vibrational damping systems. 


ZnO and PZT have been used for piezoelectric fiber optic phase modulators fabrication. The piezoelectric 
materials have been sputter deposited using dc magnetron source and multimagnetron sputtering systems. 
Coatings of 6 ym thick of ZnO and 0.5 jm of PZT are possible to achieve using these systems. However, 
thickness variation of approximately 15% occurs between the center and the end of ZnO coatings, results 
on affected modulation performance. Although PZT coatings achieved by sputtering posses uniformity and 
do not exhibit cracking, the PZT is only partially crystallized and it is actually a composite structure 
consisting of crystalline and amorphous material, diminishing the piezoelectric properties. 


Sol-gel technique for thick PZT films have been developed. It is now possible to fabricate PZT sol-gel 
films of up to 60 pm. The electrical and piezoelectrical properties of the thick films reported are 
comparable with ceramic PZT. 


Piezoelectric polymer coatings for high-frequency fiber-optic modulators have been also investigated. 
Commercial vinylidene fluoride and tetrafluoroethylene copolymer has been used. The advantage of using 
polymer coatings is that the polymer jacket (coating) can be easily obtained by melt extrusion on a single- 


mode fiber. Thus, uniformity is easily achieved and surface roughness is not present. Furthermore, if 
annealing of the polymer is made prior poling, a high degree of crystallinity is enhanced, leading to better 
piezoelectric properties. 
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Formation of Silicon and Gallium Arsenide Wafers 

Integrated circuits (ICs) and discrete solid state devices are manufactured on semiconductor 
wafers. The following focuses on the general principles and methods with regard to wafer 
formation. 


Introduction 


Integrated circuits (ICs) and discrete solid state devices are manufactured on semiconductor 
wafers. Silicon based devices are made on silicon wafers, while III-V (13-15) semiconductor 
devices are generally fabricated on GaAs wafers, however, for certain optoelectronic 
applications InP wafers are also used. The electrical and chemical properties of the wafer 
surface must be well controlled and therefore the preparation of starting wafers is a crucial 
portion of IC and device manufacturing. In order to obtain high fabrication yields and good 
device performance, it is very important that the starting wafers be of reproducibly high 
quality. For example, the front surface must be smooth and flat on both a macro- and 
microscale, because high-resolution patterns (lithography) are optically formed on the wafer. 
In principle, cutting a crystal into thin slices and polishing one side until all saw marks are 
removed and the surface appears smooth and glossy could produce a suitable wafer. 
However, due in part to the brittleness of Si and GaAs crystals, as well as the increasing 
requirements of wafer cleanliness and surface defect reduction with ever decreasing device 
geometries, a very complex series of processing steps are required to produce analytically 
clean, flat and damage-free wafer surfaces. 


The following focuses on the general principles and methods with regard to wafer formation. 
Detailed formulas, recipes, and specific process parameters are not given as they vary 
considerably among different wafer producers. However, in general, techniques for 
fabrication of Si wafers have generally become standardized within the semiconductor 
industry. In contrast, GaAs wafer technology is less standardized, possibly due to either (a) 
the similarity to silicon practices or (b) the lower production volume of GaAs wafers. There 
are two general classes of processes in the methodology of making wafers: mechanical and 
chemical. As both Si and GaAs are brittle materials, the mechanical processes for their wafer 
fabrication are similar. However, the different chemistry of Si and GaAs require that the 
chemical processes be dealt with separately. 


Wafer formation procedures 


Each of the processing steps in the conversion of a semiconductor ingot (formed by 
Czochralski or Bridgeman growth) into a polished wafer ready for device fabrication, results 
in the removal of material from the original ingot; between !/3 and '/, of the original ingot is 
sacrificed during processing. Methods for the removal of material from a crystal ingot are 
classified depending on the size of the particles being removed during the process. If the 
removed particles are much larger than atomic or molecular dimensions the process is 
described as being macro-scale. Conversely, if the material is removed atom-by-atom or 
molecule-by-molecule then the process is termed micro-scale. A further distinction between 
various types of processes is whether the removal occurs as a result of mechanical or 


chemical processes. The formation of a finished wafer from a semiconductor ingot normally 
requires six machining (mechanical) operations, two chemical operations, and at least one 
polishing (chemical-mechanical) operation. Additionally, multiple inspection and evaluation 
steps are included in the overall process. A summary of the individual steps, and their 
functions, involved in wafer production is shown in [link]. 


Process Type Function 
: : removal of conical shaped ends and impure 
cropping mechanical ; 
portions 
grinding mechanical obtain precise diameter 
orientation P identification of crystal orientation and 
; mechanical 

flatting dopant type 
etching chemical removal of surface damage 
wafering mechanical formation of individual wafers by cutting 
heat treatment thermal annihilation of undesirable electronic donors 
edge : : : 

: mechanical provide radius on the edge of the wafer 
contouring 
lapping mechanical provides requisite flatness of the wafer 
etching chemical removal of surface damage 
polishing pieeMene provides a smooth (specular) surface 

chemical 

cleaning ee | removal of organics, heavy metals, and 


particulates 


Summary of the process steps involved in semiconductor wafer production. 


Crystal shaping 


Although an as-grown crystal ingot is of high purity (99.9999%) and crystallinity, it does not 
have the sufficiently precise shape required for ready wafer formation. Thus, prior to slicing 
an ingot into individual wafers, several steps are needed. These operations required to prepare 
the crystal for slicing are referred to as crystal shaping, and are shown in [link]. 


(b) (© @ (b) 


Schematic representation of crystal shaping operations: 
(a) remove crown and taper, (b) grind to required 
diameter, (c) grind flat, and (d) slice sample for 
measurements. Shaded area represents material 
removed. 


Cropping 


The as-grown ingots have conical shaped seed (top) and tang (bottom) ends that are removed 
using a circular diamond saw for ease of further manipulation of the ingot ((link]a). The 
cuttings are sufficiently pure that they are cleaned and the recycled in the crystal growth 
operation. Portions of the ingot that fail to meet specifications of resistivity are also removed. 
In the case of silicon ingots these sections may be sold as metallurgical-grade silicon (MGS). 
Conversely, portions of the crystal that meet desired resistivity specifications may be 
preferentially selected. A sample slice is also cut to enable oxygen and carbon content to be 
determined; usually this is accomplished by Fourier transform infrared spectroscopic 
measurements (FT-IR). Finally, cropping is used to cut crystals to a suitable length to fit the 
saw Capacity. 


Grinding 


The primary purpose of crystal grinding is to obtain wafers of precise diameter because the 
automatic diameter control systems on crystal growth equipment are not capable of meeting 
the tight wafer diameter specifications. In addition, crystals are seldom grown perfectly round 
in cross section. Thus, ingots are usually grown with a 1 - 2 mm allowance and reduced to the 
proper diameter by grinding [link]b. 


Crystal grinding is a straightforward process using an abrasive grinding wheel, however, it 
must be well controlled in order to avoid problems in subsequent operations. Exit chipping in 
wafering and lattice slip in thermal processing are problems often resulting from improper 
crystal grinding. Two methods are used for crystal grinding: (a) grinding on center and (b) 
centerless grinding. 


[link] shows a schematic of the general set-up for grinding a crystal ingot on center. The 
crystal is supported at each end in a lathe-like machine. The rotating cutting tool, employing 
a water-based coolant, makes multiple passes down the rotating ingot until the requisite 
diameter is obtained. The center grinder can also be used for grinding the identification flats 
as well as providing a uniform ingot diameter. However, grinding the crystal on centers 
requires that the operator locate the crystal axis in order to obtain the best yield. 
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movement 


diamond cup 
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Schematic representation of 
grinding on center. 


Centerless grinding eliminates the problems associated with locating the crystal center. The 
centerless method is superior for long crystals; however, a centerless grinder is much larger 
than a center grinder of the same diameter capacity. In centerless grinding the ingot is 
supported between two wheels, a grinding wheel and a drive wheel. A schematic of the 
centerless grinder is shown in [link]. The axis of the drive wheel is canted with respect to that 
of the crystal ingot and the grinding wheel pushing the crystal ingot past the stationary (but 
rotating) grinding wheel, see [link]b. 


grinding ; drivewheel 
wheel drive wheel grinding 
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Schematic representation of centerless grinding viewed 
(a) along and (b) perpendicular to the crystal axis. 


Orientation/identification flats 


Following grinding of the ingot to the desired diameter, one or two flats are ground along the 
length of the ingot. The identification flats (one or two) are ground lengthwise along the 
crystal according to the orientation and the dopant type. After grinding the crystal on centers 
the crystal is rotated to the proper orientation, then the wheel is positioned with its axis of 
rotation perpendicular to the crystal axis and moved along the crystal from end to end until 
the appropriate flat size is obtained. An optical or X-ray orientation fixture may be used in 
conjunction with the crystal mounting to facilitate the proper orientation of the crystal on the 
grinder. 


The largest flat is called the primary flat ({link]c) and is parallel to one of the crystal planes, 
as determined by X-ray diffraction. The primary flat is used for automated positioning of the 
wafer during subsequent processing steps, e.g., lithographic patterning and dicing. Other 
smaller flats are called "secondary flats" and are used to identify the crystal orientation 
(<111> versus <100>) and the material (n-type versus p-type). Secondary flats provide a 
quick and easy manner by which unknown wafers can be sorted. The flats shown 
schematically in [link] are located according to a Semiconductor Equipment and Materials 
Institute (SEMI®) standard and are ground to specific widths, depending upon crystals 
diameter. Notches are also used in place of the secondary flat; however, the relative 
orientations of the notch and primary flat with regard to crystal orientation and dopant are 
maintained. 
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SEMI locations for orientation/identification flats. 


Etching 


The cropping and grinding processes are performed with relatively coarse abrasive and 
consequently a great deal of subsurface damage results. Pits, chips, and cracks all contribute 
to stress in the cut wafer and provide nuclei for crack propagation at the edges of the finished 
wafer. If regions of stress are removed then cracks will no longer propagate, reducing exit 
chipping and wafer breakage during subsequent fabrication steps. 


The general method for removing surface damage is to etch the crystal in a hot solution. The 
most common etchants for Si are based on the HNO3-HF system, in which etchant modifiers 
such as acetic acid also commonly used. In the case of GaAs HCI-HNO3 is the appropriate 
system. These etchants selectively attack the crystal at the damaged regions. After etching, 
the crystal is transferred to the slicing preparation area. 


Wafering 


The purpose of wafering is to saw the crystal into thin slices with precise geometric 
dimensions. By far, the most common method of wafering semiconductor crystals is the use 
of an annular, or inner diameter (ID), diamond saw blade. A schematic diagram of ID slicing 
technology is shown in [link]. 
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Schematic diagrams of ID slicing 
process. 


The crystal, when it arrives at the sawing area, has been ground to diameter, flatted, and 
etched. In order to slice it, the crystal must be firmly mounted in such a way that it can be 
completely converted to wafers with minimum waste. The crystal is attached with wax or 
epoxy to a mounting block, which is usually cylindrical in shape and of the same diameter as 
the ingot. Also, a mounting beam (or strip) is attached along the length of the crystal at the 
breakout point of the saw blade. This reduces exit chipping (breakage that occurs as the blade 
exits the crystal at the end of a cut) and also provides support for the sawn wafer until it is 
retrieved. Graphite or phenolic resins are common materials for the mounting block and 
beams, although some success has been obtained in mounting ingots using hydraulic 
pressure. The saw blade is a thin sheet of stainless steel (325 um), with diamond bonded to its 
inner edge. This blade is mounted on a drum that rotates at ca. 2000 rpm. Saw blades 58 cm 
(23 inches) in diameter with a 20 cm (8 inches) opening are common, however, as wafer 
sizes increase larger blades are employed: 30 cm (12 inches) wafers are now common for Si. 
The blade moves relative to the stationary crystal at a speed of 0.05 cm/s, and the cutting 
process is water-cooled. Thus, considering that wafers are sliced sequentially (one at a time), 
the overall process is very slow. A further problem is that the kerf loss (loss due to the width 
of the blade) results in approximately 1/3 of the material being lost as saw dust. Finally, the 
depth of the drum onto which the blade is attached limits the length of the ingot section that 
is accessible. In order to overcome this problem, another style of ID blade saw was developed 
in which the blade is mounted on an air bearing and is rotated by a belt drive. This allows the 
entire length of the crystal ingot to be sliced. 


Both silicon and GaAs crystals are grown with either the crystallographic <100> or <111> 
direction parallel to the cylindrical axis of the crystal. Wafers may be cut either exactly 
perpendicular to the crystallographic axis or deliberately off-axis by several degrees. In order 
to obtain the proper wafer orientation, the crystal must be properly oriented on the saw. All 
production slicing machines have adjustments for orientation of the crystal; however, it is 


usually necessary to check the orientation of the first slice in order to assure that all 
subsequent slices will be properly oriented. 


Obvious variables introduced during the wafering process include: cutting rate, wheel speed, 
and coolant flow rate. However, the condition of the machines, such as alignment and 
vibration, is the most important variable followed by the condition of the blade. A deviated 
blade rim may cause taper, bow, or warp. [link] summarizes the types of deformations that 
can occur during wafering, their physical appearance and their characteristics. 
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Deformed wafers and their characteristics. 


Heat treatment 


As-produced Czochralski grown crystals often have a level of oxygen impurity that may 
exceed the concentration of dopant in the semiconductor material (i.e., Si or GaAs). This 
oxygen impurity has a deleterious effect on the semiconductor properties, especially upon 
subsequent thermal processing, e.g., thermal oxide growth or epitaxial film growth by metal 
organic chemical vapor deposition (MOCVD). For example, when silicon crystals are heated 


to about 450 °C the oxygen undergoes a transformation that causes it to behave as an electron 
donor, much like an n-type dopant. These oxygen donors, or "thermal donors", mask the true 
resistivity of the semiconductor because they either add additional carrier electrons to a n- 
type crystal or compensate for the positive holes in a p-type crystal. Fortunately, these 
thermal donors can be "annihilated" by heat treating the materials briefly in the range of 500 - 
800 °C and then cooling quickly through the 450 °C region before donors can reform. In 
principle thermal donor annihilation can be performed on wafers at any time during their 
fabrication; however, it is usually best to perform the heat treatment immediately after 
wafering since sub-standard wafers may be rejected before additional processing steps are 
undertaken and thus limiting additional cost. Donor annihilation is a bulk effect, and 
therefore the thermal treatment can be performed in air, since any surface oxide that may 
form will be removed in subsequent lapping and polishing steps. 


Lapping or grinding 


The as-cut wafers vary sufficiently in thickness to require an additional operation, the slicing 
operation does not consistently produce the required flatness and parallelism required for 
many wafer specifications, see [link]. Since conventional polishing does not correct 
variations in flatness or thickness, a mechanical two-sided lapping operation is performed. 
Lapping is capable of achieving very precise thickness uniformity, flatness and parallelism. 
Lapping also prepares the surface for polishing by removing the sub-surface sawing damage, 
replacing it with a more uniform and smaller lapping damage. 


The process used for lapping semiconductor wafers evolved from the optical lens 
manufacturing industry using principles developed over several hundred years. However, as 
the lens has a curved surface and the wafers are flat, the equipment for lapping wafers is 
mechanically simpler than lens processing machines. The simplest double-side lapping 
machine consists of two very flat counter-rotating plates, carriers to hold and move the 
wafers between the plates, and a device to feed abrasive slurry steadily between the plates. 
The abrasive is typically a 9 pm Al,O3 grit. Commercial abrasives are suspended in water or 
glycerin with proprietary additives to assist in suspension and dispersion of the particles, to 
improve the flow properties of the slurry, and to prevent corrosion of the lapping machine. 
Hydraulics or an air cylinder applies lapping pressure with low starting pressure for 2 to 5 
minutes, which is then increased through most of the process. The completion of lapping may 
be determined by elapsed time or by an external thickness sensing device. The finished 
process gives a wafer with a surface uniform to within 2 pm. Approximately 20 ym per side 
is removed during the lapping process. 


Although lapping would appear to be simple in concept, the successful implementation of a 
production lapping operation requires the development of a technique and experience to 
achieve acceptable quality with good yields. Small adjustments to the rotation rates of the 
plates and carriers will cause the plates to wear concave, convex or flat. 


As lapping is a messy process, various efforts have been made to avoid it or to substitute an 
alternative process. The most likely approach at present is grinding, in which the wafer is 
held on a vacuum chuck and a series of progressively finer diamond wheels is moved over 


the wafer while it is rotated on a turn table. Grinding gives a clearer surface than lapping, 
however, only one side may be ground at a time and the resulting flatness is not as good as 
that obtained by lapping. 


Edge contouring 


The rounding of the edge of the wafer to a specific contour is a fairly recent development in 
the technology of wafer preparation. It was known by the early seventies that a significant 
number of device yield problems could be traced to the physical condition of the wafer edge. 
An acute edge affects the strength of the wafer due to: stress concentration, and a lowering of 
its resistance to thermal stress, as well as being the source of particle chip, breakage, and 
lattice damage. In addition, the particles originating from the chipped edges can, if present on 
the wafer surface, add to the defect density (Do) of the IC process reducing fabrication yield. 
Further problems associated with a square edge include the build-up of photoresist at the 
wafer edge. The solution to these process problems is to provide a contoured edge with a 
defined radius (r). 


Chemical etching of wafers results in a degree of edge rounding, but it is difficult to control. 
Thus, mechanical edge contouring has been developed and the result has been a dramatic 
improvement in yields in downstream wafer processing. Losses due to wafer breakage are 
also reduced. The edge contouring process is usually performed in cassette-fed high speed 
equipment, in which each wafer is rotated rapidly against a shaped cutting tool ({link]). 
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Schematic illustration of edge contouring. 


Etching 


The mechanical processes described above to shape the wafer leave the surface and edges 
damaged and contaminated. The depth of the work damage depends on the specific process, 
however, 10 um is typical. Such damage is readily removed by chemical etching. Etching is 
used at multiple points during the fabrication of a semiconductor device. The discussion 
below is limited to etches suitable for wafer fabrication, i.e., non-selective etching of the 
entire wafer surface. 


Wet chemical etching 


The wet chemical etching of any material can be considered to involve three steps: (a) 
transportation of the reactants to the surface, (b) reaction at the surface, and (c) movement of 
the reaction products into the etchant solution ({link]). Each of these may be the rate limiting 
step and thus control the etch rate and uniformity. This effect is summarized in [link]. 
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Schematic representation of the three steps 
involved in wet chemical etching: (i) diffusion of 
the chemical etch reagents through the boundary 

layer, (ii) chemical reaction at the surface, and (iii) 
diffusion of the reaction products into the etch 
solution through the boundary layer. 
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Effects of rate limiting step in semiconductor etching. 


An etchant that is limited by the rate of reaction at the surface will tend to enhance any 
surface features and promote surface roughness due to preferential etching at defects 
(anisotropic). In contrast, if the etch rate is limited by the diffusion of the etchant reagent 
through a stagnant (dead) boundary layer near the surface, then the etch will result in uniform 
polishing and the surface will become smooth (isotropic). If removal of the reaction products 
is rate limiting then the etch rate will be slow because the etch equilibrium will be shifted 
towards the reactants. In the case of an individual etchant reaction, the rate determining step 
may be changed by rapid stirring to aid removal of reaction products, or by increasing the 
temperature of the etch solution, see [link]. The exact etching conditions are chosen 
depending on the application. For example, dilute high temperature etches are often 
employed where the etch damage must be minimized, while cooled etches can be used where 
precise etch control is required. 
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Typical etch rate versus temperature plot for 
a mixture of HF (20%), nitric acid (45%), 
and acetic acid (35%). 


Traditionally mixtures of hydrofluoric acid (HF), nitric acid (HNO3) and acetic acid 
(MeCO>H) have been used for silicon, but alkaline etches using potassium hydroxide (KOH) 
or sodium hydroxide (NaOH) solutions are increasingly common. Similarly, gallium arsenide 
etches may be either acidic or basic, however, in both cases the etches are oxidative due to 
the use of hydrogen peroxide. A wide range of chemical reagents are commercially available 
in "transistor grade" purity and these are employed to minimize contamination of the 


semiconductor. Deionized water is commonly used as a diluent for each of these reagents and 
the concentration of commonly used aqueous reagents is given in [link]. 


Reagent Weight % Reagent Weight % 
HCl 37 HF 49 

H SO, 98 H3PO4 85 

HNO; 79 HClO, 70 
MeCO,H 99 H 05 30 
NH,OH 29 


Weight percent concentration of commonly used concentrated aqueous reagents. 


The equipment used for a typical etchant process includes an acid (or alkaline) resistant tank, 
which contains the etchant solution and one or more positions for rinsing the wafers with 
deionized water. The process is batch in nature involving tens of wafers and the best 
equipment provides a means of rotating the wafers during the etch step to maintain 
uniformity. In order to assure the removal of all surface damage, substantial over-etching is 
performed. Thus, the removal of 20 1m from each side of the wafer is typical. Etch times are 
usually several minutes per batch. 


Etching silicon 


The most commonly used etchants for silicon are mixtures of hydrofluoric acid (HF) and 
nitric acid (HNO3) in water or acetic acid (MeCO>H). The etching involves a reduction- 
oxidation (redox) reaction, followed by dissolution of the reaction products. In the HF-HNO3 
system the HNO3 oxidizes the silicon and the HF removes the reaction products from the 
surface. The overall reaction is: 

Equation: 


Si + HNO, + 6 HF > H,SiF, + HNO, + H,O 


The oxidation reaction involves the oxidation of Si to Si**, and it is auto-catalytic in that the 
reaction product promotes the reaction itself. The initial step involves trace impurities of 


HNO, in the HNO; solution, [link], which react to liberate nitrogen dioxide (NO>), [link]. 
Equation: 


HNO, + HNO, > NO, +H;0 
Equation: 


N,O, > 2.NO, 


The nitrogen dioxide oxidizes the silicon surface in the presence of water, resulting in the 
formation of Si(OH)» and the reformation of HNOb, [link]. The Si(OH) decomposes to give 
SiO», [link]. Since the reaction between HNO» and HNOs, [link], is rate limiting, an 
induction period is observed. However, this is overcome by the addition of NO. ions in the 
form of [NH,][NO>]. 

Equation: 


Si’ +2 NO, +2 H,O > Si(OH), + 2 HNO, 
Equation: 


Si(OH), > SiO, + H, 


The final step of the etch process is the dissolution of the SiO» by HF, [link]. Stirring serves 
to remove the soluble products from the reaction surface. The role of the HF is to act as a 
complexing reagent, and thus the reaction shown in [link] is known as a complexing reaction. 
The formation of water as a reaction product requires that acetic acid be used as a diluent 
(solvent) to ensure better control. 

Equation: 


SiO, +6 HF > H,SiF, + H,O 


The etching reaction is highly dependent on the relative ratios of the etchant reagents. Thus, 
if an HF-rich solution is used, the reaction is limited by the oxidation step, [link], and the 
etching is anisotropic, since the oxidation reaction is sensitive to doping, crystal orientation, 
and defects. In contrast, the use of a HNO3-rich solution produces isotropic etching since the 
dissolution process is rate limiting ({link]). The reaction of HNO3-rich solutions has been 
found to be diffusion-controlled over the temperature range 20 - 50 °C ([link]), and is 
therefore commonly employed for removing work damage produced during wafer 
fabrication. The boundary layer thickness ({link]) and therefore the dimensional control over 
the wafer is controlled by the rotation rate of the wafers. A common etch formulation is a 
4:1:3 mixture of HNO3 (79%), HF (49%), and MeCO>H (99%). There are some etchant 


formulations that are based on alternative (or additional) oxidizing agents, such as: Bro, Lb, 
and KMnO,. 


Alkaline etching (KOH/H20 or NaOH/H)0) is by nature anisotropic and the etch rate 
depends on the number of dangling bonds which in turn are dependent on the surface 
orientation. Since etching is reaction rate limited no rotation of the wafers is necessary and 
excellent uniformity over large wafers is obtained. Alkaline etchants are used with large 
wafers where dimensional uniformity is not maintained during lapping. A typical formulation 
uses KOH in a 45% weight solution in HzO at 90 °C. 


Etching gallium arsenide 


Although a wide range of etches have been investigated for GaAs, few are truly isotropic. 
This is because the surface activity of the (111) Ga and (111) As faces are very different. The 
As rich face is considerably more reactive than the Ga rich face, thus under identical 
conditions it will etch faster. As a result most etches give a polished surface on the As face, 
but the Ga face tends to appear cloudy or frosted due to the highlighting of surface features 
and crystallographic defects. 


As with silicon the etch systems involve oxidation and complexation. However, in the case of 
GaAs the gallium is already fully oxidized (formally Ga**), thus, it is the arsenic (formally 
the arsenide ion, As* that is oxidized by a suitable oxidizing agent (e.g., HO>) to the soluble 
oxide, As»O3, [link]. The gallium ions form the oxide Ga)O3 via the hydroxide, [link]. Both 
oxides are soluble in acid solutions, resulting in their removal from the surface. 

Equation: 


2 As* +6 H,O, + H* > As,O, +5 OH +4H,0 
Equation: 


2 Ga** + 6 OH > 2 Ga(OH); +3 H,O 


The peroxide based oxidative etches for GaAs are divided into acidic and basic etches. The 
composition and application of some of these systems are summarized in [link]. The most 
widely used of these is HySO,4/H»O>/H>O and is referred to as Caro's acid. The high viscosity 
of H2SO, results in diffusion-limited etching with high acid concentrations. Etches with low 
acid concentrations tend to be anisotropic. Phosphoric acid (H3PO,) or citric acid ((link]) 
may be exchanged for sulfuric acid (H»SO,). Replacement of the acid component with bases 
such as NH,OH or NaOH can result in near to truly isotropic etchants, although certain 
combinations can result in strong anisotropy. 


Formulation volume etch rate etch rate etch rate 
ratio (um/min) (m/min) (m/min) 
ae 5 3:1:50 0.8 0.8 0.8 
ae Lid 0.6 0.6 0.6 
ne . 1:700 0.3 0.3 0.3 
ee " 1:0.76 0.2 0.2 0.2 


The composition and application of selected etch systems for GaAs. 
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One of the earliest etching systems for GaAs is based on the use of a dilute (ca. 0.05 vol.%) 
solution of bromine (Br>) in ethanol. The Bry acts as the oxidant, resulting in the formation of 
soluble bromides. The etch rate of this system is different for different crystallographic 
planes, i.e., the etch rates for the (111) As, (100), and (111) Ga faces are in the ratio 6:5:1, 
although more uniform etch rates are observed with high Br concentrations (ca. 10 vol.%). 
These higher concentration solutions are used for the removal of damage due to cutting with 


the saw. 


Polishing 


The purpose of polishing is to produce a smooth, specular surface on which device features 
can be defined by lithography. In order to allow for very large scale integration (VLSI) or 
ultra large scale integration (ULSI) fabrication the wafer must have a surface with a high 
degree of flatness. Variations less than 5 to 10 ym across the wafer diameter are typical 
flatness specifications. In addition, given the preceding steps, wafer polishing must not leave 
residual contamination or surface damage. The techniques of wafer polishing are derived 
from the glass lens industry, with some important modifications that have been developed to 
meet the special requirements of the microelectronics industry. 


Differences between polishing and lapping 


If the surface of a wafer that has undergone lapping (or grinding) is examined with an 
electron microscope, cracks, ridges and valleys are observed. The top "relief layer" consists 
of peaks and valleys. Below this layer is a damaged layer characterized by microcracks, 
dislocations, slip and stress. [link] shows a schematic representation of the abraded surface. 
Both of these layers must be removed completely prior to further fabrication. Decreasing the 
particle size of the abrasive during lapping only decreases the scale of the damage, but does 
not eliminate it entirely. In fact this surface damage is a characteristic of the brittle fracture of 
single crystal Si and GaAs, and occurs because during lapping the abrasive grains are moved 
across the surface under a pressure beyond that of the fracture strength of the wafer materials 
(Si or GaAs). In contrast to the mechanical abrasion employed in lapping, polishing is a 
mechano-chemical process during which brittle fracture does not occur. A polished wafer 
does not display any evidence of a relief surface such as that produced by lapping, even at 
highest resolution electron microscope. 
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Schematic representation of a cross sectional view of an 
abraded wafer surface prior to polishing. 


Process of Polishing 


[link] shows a schematic of the polishing process. Polishing may be conducted on single 
wafers or as a batch process depending on the equipment employed. Single wafer polishing is 
preferred for larger wafers and allows for better surface flatness. In both processes, wafers are 
mounted onto a fixture, by either wax or a composite Felx-Mount™, and pressed against the 
polishing pad. The polishing pad is usually made from an artificial fabric such as polyester 
felt-polyurethane laminate. Polishing is accomplished by a mechano-chemical process in 
which aqueous polishing slurry is dripped onto the polishing pad, see [link]. The polishing 
slurry performs both a chemical and mechanical process, and consists of fine silica (SiO>) 
particles (100 A diameter) and an oxidizing agent. Aqueous sodium hydroxide (NaOH) is 
used for Si, while aqueous sodium chlorate (NaOCl) is preferred for GaAs. Suspending 
agents are usually added to prevent settling of the silica particles. Under the heat caused by 
the friction of the wafer on the polishing pad the wafer surface is oxidized, which is the 
chemical step, while in the mechanical step the silica particles in the slurry abrade the 
oxidized surface away. 
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Schematic representation of the wafer 
polishing process. 


In order to achieve a reasonable rate of removal of the relief and damaged layers and still 
obtain the highest quality surface, the polishing is done in two steps, stock removal and haze 
removal. The former is carried out with a higher concentration slurry and may proceed for 
about 30 minutes at a removal rate of 1 pm/min. Haze removal is performed with a very 
dilute slurry, a softer pad with a reaction time of about 5 to 10 minutes, during which the total 
amount of material removed is only about 1 jm. Due to the active chemical reaction between 
the wafer and the polishing agent, the wafers must be rinsed in deionized water immediately 
after polishing to prevent haze or stains from reforming. 


There are many variables that will influence the rate and quality of polishing. High pressure 
results in a higher polishing rate, but excessive pressure may cause non-uniform polishing, 
excessive heat generation and fast pad wear. The rate of polishing is increased with higher 
temperatures but this may also lead to haze formation. High wheel speeds accelerate the 


polishing rate but can raise the temperature and also results in problems in maintaining a 
uniform flow of slurry across the pad. Dense slurry concentrations increase the polishing rate 
but are more costly. The pH of the slurry solution can also affect the polishing rate, for 
example the polishing rate of Si gradually increases with increased pH (higher basicity) until 
a pH of about 12 where a dramatic decrease is observed. In general, the optimum polishing 
process for a given facility depends largely upon the interplay of product specification, 
yields, cost, and quality considerations and must be developed uniquely. The wafer polishing 
process does not improve the wafer flatness and, at best, polishing will not degrade the wafer 
flatness achieved in the lapping operation. 


Cleaning 


During the processes described above, semiconductor wafers are subjected to physical 
handling that leads to significant contamination. Possible sources of physical contamination 
include: 


a. airborne bacteria, 

b. grease and wax from cutting oils and physical handling, 

c. abrasive particulates (usually, silica, silicon carbide, alumina, or diamond dust) from 
lapping, grinding or sawing operations, 

d. plasticizers which are derived from containers and wrapping in which the wafers are 
handled and shipped. 


Chemical contamination may also occur as a result of improper cleaning after etch steps. 
Light-metal (especially sodium and potassium) species may be traced to impurities in etchant 
solutions and are chemisorbed on to the surface where they are particularly problematical for 
metal oxide semiconductor (MOS) based devices, although higher levels of such impurities 
are tolerable for bipolar devices. Heavy metal impurities (e.g., Cu, Au, Fe, and Ag) are 
usually caused by electrodeposition from etchant solutions during fabrication. While wafers 
are cleaned prior to shipping, contamination accumulated during shipping and storage 
necessitates that all wafers be subjected to scrupulous cleaning prior to fabrication. 
Furthermore, cleaning is required at each step during the fabrication process. Although wafer 
cleaning is a vital part of each fabrication step, it is convenient to discuss cleaning within the 
general topic of wafer fabrication. 


Cleaning silicon 


The first step in cleaning a Si wafer is removal of all physical contaminants. These 
contaminates are removed by rinsing the wafer in hot organic solvents such as 1,1,1- 
trichloroethane (Cl3CH3) or xylene (CgH,Me>), accompanied by mechanical scrubbing, 
ultrasonic agitation, or compressed gas jets. Removal of the majority of light metal 
contaminants is accomplished by rinsing in hot deionized water, however, complete removal 
requires a further more aggressive cleaning process. The most widely used cleaning method 


in the Si semiconductor industry is based on a two step, two solution sequence known as the 
“RCA Cleaning Method”. 


The first solution consists of HyO-H»O -NH,OH in a volume ratio of 5:1:1 to 7:2:1, which is 
used to remove organic contaminants and heavy metals. The oxidation of the remaining 
organic contaminants by the hydrogen peroxide (H2O>) produces water soluble products. 
Similarly, metal contaminants such as cadmium, cobalt, copper, mercury, nickel, and silver 
are solubilized by the NH,OH through the formation of soluble amino complexes, e.g., [link]. 
Equation: 


2 Cu**(s) + 6 NH,OH (aq) > [Cu(NH;)¢ (aq) 


The second solution consists of HyO-H 0 -HCI in a 6:1:1 to 8:2:1 volume ratio and removes 
the Group [(1), II(2) and III(13) metals. In addition, the second solution prevents re- 
deposition of the metal contaminants. Each of the washing steps is carried out for 10 - 20 
min. at 75 - 85 °C with rapid agitation. Finally, the wafers are blown dry under a stream of 
nitrogen gas. 


Cleaning GaAs 


In principle GaAs wafers may be cleaned in a similar manner to silicon wafers. The first step 
involves successive cleaning with hot organic solvents such as 1,1,1-trichloroethane, acetone, 
and methanol, each for 5-10 minutes. GaAs wafers cleaned in this manner may be stored 
under methanol for short periods of time. 


Most cleaning solutions for GaAs are actually etches. A typical solution is similar to the 
second RCA solution and consists of an 80:10:1 ratio of HyO-HO,-HCI. This solution is 
generally used at elevated temperatures (70 °C) with short dip times since it has a very fast 
etch rate (4.0 m/min). 


Measurements, inspections and packaging 


Quality control measurements of the semiconductor crystal and subsequent wafer are 
performed throughout the process as an essential part of the fabrication of wafers. From 
crystal and wafer shaping through the final wafer finishing steps, quality control 
measurements are used to ensure that the materials meets customer specifications, and that 
problems can be corrected before they create scrap material and thus avoid further processing 
of reject material. Quality control measurements can be broadly classified into mechanical, 
electrical, structural, and chemical. 


Mechanical measurements are concerned with the physical dimensions of the wafer, 
including: thickness, flatness, bow, taper and edge contour. Electrical measurements usually 
include: resistivity and lateral resistivity gradient, carrier type and lifetime. Measurements 


giving information on the perfection of the semiconductor crystal lattice are classified in the 
structural category and include: testing for stacking faults, and dislocations. Routine chemical 
measurements are limited to the measurement of dissolved oxygen and carbon by Fourier 
transform infrared spectroscopy (FT-IR). Finished wafers are individually marked for the 
purpose of identification and traceability. Packaging helps protect the finished wafers from 
contamination during shipping and storage. 


Industry standards defining in detail how quality control measurements are to be made and 
determining the acceptable ranges for measured values have been developed by the American 
Society of Testing Materials (ASTM) and the Semiconductor Equipment and Materials 
Institute (SEMI). 
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Doping 


Starting with a prepared, polished wafer then how do we get an integrated 
circuit? We will focus on the CMOS process, described in the last chapter. 
Let's assume we have wafer which was doped during growth so that it has a 
background concentration of acceptors in it (i.e. it is p-type). Referring back 
to CMOS Logic, you can see that the first thing we need to build is a n-tank 
or moat. In order to do this, we need some way in which to introduce 
additional impurities into the semiconductor. There are several ways to do 
this, but current technology relies almost exclusively on a technique called 
ion implantation. A diagram of an ion-implanter is shown in the figure in 
the previous section. An ion implanter uses a dopant source gas, ionizes it, 
and drives the ions into the wafer. The dopant gas is ionized and the 
resultant charged ions are accelerated through a magnetic field, where they 
are mass-analyzed. The vertical magnetic field causes the beam of ions to 
spread out, according to their mass. A thin aperture selects the ions of 
interest, and lets them pass, blocking all the others. This makes sure we are 
only implanting the ion we want, and in fact, even selects for the proper 
isotope! The ionized atoms are then accelerated through several tens to 
hundreds of kV, and then deflected by an electric field, much like in an 
oscilloscope CRT. In fact, most of the time the ion beam is "rastered" across 
the surface of the silicon wafer. The ions strike the silicon wafer and pass 
into its interior. A measurement of the current flow in the system and its 
integral, is a measure of how much dopant was deposited into the wafer. 
This is usually given in terms of the number of dopant stows. to which the 


wafer has been exposed. 


After the atoms enter the silicon, they interact with the lattice, creating 
defects, and slowing down until finally they stop. Typical atomic 
distributions, as a function of implant voltage are show in [link] for 
implantation into amorphous silicon. When implantation is done on single 
crystal material, channeling, the improved mobility of an ion down the 
"hallway" of a given lattice direction, can skew the impurity distribution 
significantly. Just slight changes of less than a degree can make big 
differences in how the impurity atoms are finally distributed in the wafer. 
Usually, the operator of the implant machine purposely tilts the wafer a few 


degrees off normal to the beam in order to arrive at more reproducible 
results. 
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As you might expect, shooting 100 kV ions at a silicon wafer probably does 
quite a bit of damage to the crystal structure. Not only that, but just having, 
say boron, in your wafer does not mean you are going to have holes. For the 
boron to become "electrically active" - that is to act as an acceptor - it has to 
reside on a silicon lattice site. Even if the boron atom does, somehow, end 
up on an actual lattice site when it stops crashing around in the wafer, the 
many defects which have been created will act as deep traps. Thus, the hole 
which is formed will probably be caught at a trap site and will not be able to 
contribute to electrical conductivity in the wafer anyway. How can we fix 
this situation? If we carefully heat up the wafer, we can cause the atoms in 
the crystal to shake around, and if we do it right, they all get back where 
they belong. Not only that, but the newly added impurities end up on lattice 
sites as well! This step is called annealing and it does just what it is 
supposed to. Typical temperatures and times for such an anneal are 500 to 
1000°C for 10 to 30 minutes. 


Something else occurs during the anneal step however. We have just added, 
by our implantation step, impurities with a fairly tight distribution as shown 
in [link]. There is an obvious gradient in impurity distribution, and if there 


is a gradient, than things may start moving around by diffusion, especially 
at elevated temperatures. 


Applications for Silica Thin Films 


Introduction 


While the physical properties of silica make it suitable for use in protective 
and optical coating applications, the biggest application of insulating SiO» 
thin films is undoubtedly in semiconductor devices, in which the insulator 
performs a number of specific tasks, including: surface passivation, field 
effect transistor (FET) gate layer, isolation layers, planarization and 
packaging. 


The term insulator generally refers to a material that exhibits low thermal or 
electrical conductivity; electrically insulating materials are also called 
dielectrics. It is in regard to the high resistance to the flow of an electric 
current that SiO» thin films are of the greatest commercial importance. The 
dielectric constant (€) is a measure of a dielectric materials ability to store 
charge, and is characterized by the electrostatic energy stored per unit 
volume across a unit potential gradient. The magnitude of ¢ is an indication 
of the degree of polarization or charge displacement within a material. The 
dielectric constant for air is 1, and for ionic solids is generally in the range 
of 5 - 10. Dielectric constants are defined as the ratio of the material’s 
capacitance to that of air, i.e., [link]. The dielectric constant for silicon 
dioxide ranges from 3.9 to 4.9, for thermally and plasma CVD grown films, 
respectively. 

Equation: 


& = Comerai/C 


material air 


An insulating layer is a film or deposited layer of dielectric material 
separating or covering conductive layers. Ideally, in these application an 
insulating material should have a surface resistivity of greater than 10!% 
Q/cm? or a volume resistivity of greater than 10!! Q.cm. However, for 
some applications, lower values are acceptable; an electrical insulator is 
generally accepted to have a resistivity greater than 10° Q.cm. CVD SiO, 
thin films have a resistivity of 10° - 10'° Q.cm, depending on the film 
growth method. 


As a consequence of its dielectric properties SiO», and related silicas, are 
used for isolating conducting layers, to facilitate the diffusion of dopants 
from doped oxides, as diffusion and ion implantation masks, capping doped 
films to prevent loss of dopant, for gettering impurities, for protection 
against moisture and oxidation, and for electronic passivation. Of the many 
methods used for the deposition of thin films, chemical vapor deposition 
(CVD) is most often used for semiconductor processing. In order to 
appreciate the unique problems associated with the CVD of insulating SiO» 
thin films it is worth first reviewing some of their applications. Summarized 
below are three areas of greatest importance to the fabrication of 
contemporary semiconductor devices: isolation and gate insulation, 
passivation, and planarization. 


Device isolation and gate insulation 


A microcircuit may be described as a collection of devices each consisting 
of "an assembly of active and passive components, interconnected within a 
monolithic block of semiconducting material". Each device is required to be 
isolated from adjacent devices in order to allow for maximum efficiency of 
the overall circuit. Furthermore within a device, contacts must also be 
electrically isolated. While there are a number of methods for isolating 
individual devices within a circuit (reverse-biased junctions, mesa isolation, 
use of semi-insulating substrates, and oxide isolation), the isolation of the 
active components in a single device is almost exclusively accomplished by 
the deposition of an insulator. 


In [link] is shown a schematic representation of a silicon MOSFET (metal- 
oxide-semiconductor field effect transistor). The MOSFET is the basic 
component of silicon-CMOS (complimentary metal-oxide-semiconductor) 
circuits which, in turn, form the basis for logic circuits, such as those used 
in the CPU (central processing unit) of a modern personal computer. It can 
be seen that the MOSFET is isolated from adjacent devices by a reverse- 
biased junction (p*-channel stop) and a thick oxide layer. The gate, source 
and drain contact are electrically isolated from each other by a thin 
insulating oxide. A similar scheme is used for the isolation of the collector 
from both the base and the emitter in bipolar transistor devices. 
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Schematic diagrams of a Si- MOSFET (metal-oxide-semiconductor 
field effect transistor). 


As a transistor, a MOSFET has many advantages over alternate designs. 
The key advantage is low power dissipation resulting from the high 
impedance of the device. This is a result of the thin insulation layer between 
the channel (region between source and drain) and the gate contact, see 
[link]. The presence of an insulating gate is characteristic of a general class 
of devices called MISFETs (metal-insulator-semiconductor field effect 
transistor). MOSFETs are a subset of MISFETs where the insulator is 
specifically an oxide, e.g., in the case of a silicon MISFET device the 
insulator is SiO», hence MOSFET. It is the fabrication of MOSFET circuits 
that has allowed silicon technology to dominate digital electronics (logic 
circuits). However, increases in computing power and speed require a 
constant reduction in device size and increased complexity in device 
architecture. 


Passivation 


Passivation is often defined as a process whereby a film is grown on the 
surface of a semiconductor to either (a) chemically protect it from the 
environment, or (b) provide electronic stabilization of the surface. 


From the earliest days of solid state electronics it has been recognized that 
the presence or absence of surface states plays a decisive role in the 
usefulness of any semiconducting material. On the surface of any solid state 
material there are sites in which the coordination environment of the atoms 
is incomplete. These sites, commonly termed "dangling bonds", are the 
cause of the electronically active states which allow for the recombination 
of holes and electrons. This recombination occurs at energies below the 
bulk value, and interferes with the inherent properties of the semiconductor. 
In order to optimize the properties of a semiconductor device it is desirable 
to covalently satisfy all these surface bonds, thereby shifting the surface 
states out of the band gap and into the valence or conduction bands. 
Electronic passivation may therefore be described as a process which 
reduces the density of available electronic states present at the surface of a 
semiconductor, thereby limiting hole and electron recombination 
possibilities. In the case of silicon both the native oxide and other oxides 
admirably fulfill these requirements. 


Chemical passivation requires a material that inhibits the diffusion of 
oxygen, water, or other species to the surface of the underlying 
semiconductor. In addition, the material is ideally hard and resistant to 
chemical attack. A perfect passivation material would satisfy both 
electronic and chemical passivation requirements. 


Planarization 


For the vast majority of electronic devices, the starting point is a substrate 
consisting of a flat single crystal wafer of semiconducting material. During 
processing, which includes the growth of both insulating and conducting 
films, the surface becomes increasingly non-planar. For example, a gate 
oxide in a typical MOSFET (see [link]) may be typically 100 - 250 A thick, 
while the isolation or field oxide may be 10,000 A. In order for the 
successful subsequent deposition of conducting layers (metallization) to 
occur without breaking metal lines (often due to the difficulty in 
maintaining step coverage), the surface must be flat and smooth. This 
process is called planarization, and can be carried out by a technique known 
as sacrificial etchback. The steps for this process are outlined in [link]. An 
abrupt step ([link]a) is coated with a conformal layer of a low melting 


dielectric, e.g., borophosphorosilicate glass, BPSG ({link |b), and 
subsequently a sacrificial organic resin ([link]c). The sample is then plasma 
etched such that the resin and dielectric are removed at the same rate. Since 
the plasma etch follows the contour of the organic resin, a smooth surface is 
left behind ({link]d). The planarization process thus reduces step height 
differentials significantly. In addition regions or valleys between individual 
metallization elements (vias) can be completely filled allowing for a route 
to producing uniformly flat surfaces, e.g., the BPSG film shown in [link]. 
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Schematic representation of the planarization process. 


A metallization feature (a) is CVD covered with 
silicate glass (b), and subsequently coated with an 
organic resin (c). After etching the resist a smooth 

silicate surface is produced (d). 


The processes of planarization is vital for the development of multilevel 
structures in VLSI circuits. To minimize interconnection resistance and 
conserve chip area, multilevel metallization schemes are being developed in 
which the interconnects run in 3-dimensions. 
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Oxidation of Silicon 


Note:This module was developed as part of the Rice University course 
CHEM-496: Chemistry of Electronic Materials. This module was prepared 
with the assistance of Andrea Keys. 


Introduction 


In the fabrication of integrated circuits (ICs), the oxidation of silicon is 
essential, and the production of superior ICs requires an understanding of 
the oxidation process and the ability to form oxides of high quality. Silicon 
dioxide has several uses: 


1. Serves as a mask against implant or diffusion of dopant into silicon. 
2. Provides surface passivation. 

3. Isolates one device from another (dielectric isolation). 

4, Acts as a component in MOS structures. 

5. Provides electrical isolation of multi-level metallization systems. 


Methods for forming oxide layers on silicon have been developed, 
including thermal oxidation, wet anodization, chemical vapor deposition 
(CVD), and plasma anodization or oxidation. Generally, CVD is used when 
putting the oxide layer on top of a metal surface, and thermal oxidation is 
used when a low-charge density level is required for the interface between 
the oxide and the silicon surface. 


Oxidation of silicon 


Silicon's surface has a high affinity for oxygen and thus an oxide layer 
rapidly forms upon exposure to the atmosphere. The chemical reactions 
which describe this formation are: 

Equation: 


Equation: 


In the first reaction a dry process is utilized involving oxygen gas as the 
oxygen source and the second reaction describes a wet process which uses 
steam. The dry process provides a "good" silicon dioxide but is slow and 
mostly used at the beginning of processing. The wet procedure is 
problematic in that the purity of the water used cannot be guaranteed to a 
suitable degree. This problem can be easily solved using a pyrogenic 
technique which combines hydrogen and oxygen gases to form water vapor 
of very high purity. Maintaining reagents of high quality is essential to the 
manufacturing of integrated circuits, and is a concern which plagues each 
step of this process. 


The formation of the oxide layer involves shared valence electrons between 
silicon and oxygen, which allows the silicon surface to rid itself of 
"dangling" bonds, such as lone pairs and vacant orbitals, [link]. These 
vacancies create mid-gap states between the valence and conduction bands, 
which prevents the desired band gap of the semiconductor. The Si-O bond 
strength is covalent (strong), and so can be used to achieve the loss of mid- 
gap states and passivate the surface of the silicon. 
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Removal of dangling bonds by oxidation of surface. 


The oxidation of silicon occurs at the silicon-oxide interface and consists of 
four steps: 


Diffusive transport of oxygen across the diffusion layer in the vapor phase 
adjacent to the silicon oxide-vapor interface. 

Incorporation of oxygen at the outer surface into the silicon oxide film. 
Diffusive transport across the silicon oxide film to its interface with the 
silicon lattice. 

Reaction of oxygen with silicon at this inner interface. 


As the Si-SiO> interface moves into the silicon its volume expands, and 
based upon the densities and molecular weights of Si and SiO», 0.44 A Si is 
used to obtain 1.0 A SiOp. 


Pre-oxidation cleaning 


The first step in oxidizing a surface of silicon is the removal of the native 
oxide which forms due to exposure to open air. This may seem redundant to 
remove an oxide only to put on another, but this is necessary since 
uncertainty exists as to the purity of the oxide which is present. The 
contamination of the native oxide by both organic and inorganic materials 
(arising from previous processing steps and handling) must be removed to 
prevent the degradation of the essential electrical characteristics of the 
device. A common procedure uses a HyO-H»O,-NH,OH mixture which 
removes the organics present, as well as some group I and II metals. 
Removal of heavy metals can be achieved using a H»O-H»O>-HCI mixture, 
which complexes with the ions which are formed. After removal of the 
native oxide, the desired oxide can be grown. This growth is useful because 
it provides: chemical protection, conditions suitable for lithography, and 
passivation. The protection prevents unwanted reactions from occurring and 
the passivation fills vacancies of bonds on the surface not present within the 
interior of the crystal. Thus the oxidation of the surface of silicon fulfills 
several functions in one step. 


Thermal oxidation 


The growth of oxides on a silicon surface can be a particularly tedious 
process, since the growth must be uniform and pure. The thickness wanted 
usually falls in the range 50 - 500 A, which can take a long time and must 
be done on a large scale. This is done by stacking the silicon wafers in a 
horizontal quartz tube while the oxygen source flows over the wafers, 
which are situated vertically in a slotted paddle (boat), see [link]. This 
procedure is performed at 1 atm pressure, and the temperature ranges from 
700 to 1200 °C, being held to within +1 °C to ensure uniformity. The choice 
of oxidation technique depends on the thickness and oxide properties 
required. Oxides that are relatively thin and those that require low charge at 
the interface are typically grown in dry oxygen. When thick oxides are 
required (> 0.5 mm) are desired, steam is the source of choice. Steam can 
be used at wide range of pressures (1 atm to 25 atm), and the higher 
pressures allow thick oxide growth to be achieved at moderate temperatures 
in reasonable amounts of time. 
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Horizontal diffusion tube showing the 
oxidation of silicon wafers at 1 atm 
pressure. 


The thickness of SiO> layers on a Si substrate is readily determined by the 
color of the film. [link] provides a guidline for thermal grown oxides. 
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Color chart for thermally grown SiO, films observed under daylight 
fluorescent lighting. 


High pressure oxidation 


High pressure oxidation is another method of oxidizing the silicon surface 
which controls the rate of oxidation. This is possible because the rate is 
proportional to the concentration of the oxide, which in turn is proportional 
to the partial pressure of the oxidizing species, according to Henry's law, 
[link], where C is the equilibrium concentration of the oxide, H is Henry's 
law constant, and pg is the partial pressure of the oxidizing species. 
Equation: 


C = Hg) 


This approach is fast, with a rate of oxidation ranging from 100 to 1000 
mm/h, and also occurs at a relatively low temperature. It is a useful process, 
preventing dopants from being displaced and also forms a low number of 
defects, which is most useful at the end of processing. 


Plasma oxidation 


Plasma oxidation and anodization of silicon is readily accomplished by the 
use of activated oxygen as the oxidizing species. The highly reactive 
oxygen is formed within an electrical discharge or plasma. The oxidation is 
carried out in a low pressure (0.05 - 0.5 Torr) chamber, and the the plasma 
is produced either by a DC electron source or a high-frequency discharge. 
In simple plasma oxidation the sample (i.e., the silicon wafer) is held at 


ground potential. In contrast, aniodization systems usually have a DC bias 
between the sample and an electrode with the sample biased positively with 
respect to the cathode. Platinum electrodes are commonly used as the 
cathodes. 


There have been at least 34 different reactions reported to occur in an 
oxygen plasma, however, the vast majority of these are inconsequential 
with respect to the formation of active species. Furthermore, many of the 
potentially active species are sufficiently short lived that it is unlikely that 
they make a significant contribution. The primary active species within the 
oxygen plasma are undoubtedly O" and O7*. Both being produced in near 
equal quantities, although only the former is relevant to plasma 
aniodization. While these species may be active with respect to surface 
oxidation, it is more likely that an electron transfer occurs from the 
semiconductor surface yields activated oxygen species, which are the actual 
reactants in the oxidation of the silicon. 


The significant advatage of plasma processes is that while the electron 
temperature of the ionized oxygen gas is in excess of 10,000 K, the thermal 
temperatures required are significantly lower than required for the high 
pressure method, i.e., < 600 °C. The advantages of the lower reaction 
temperatures include: the minimization of dopant diffusion and the 
impediment of the generation of defects. Despite these advantages there are 
two primary disadvantages of any plasma based process. First, the high 
electric fields present during the processes cause damage to the resultant 
oxide, in particular, a high density of interface traps often result. However, 
post annealing may improve film quality. Second, the growth rates of 
plasma oxidation are low, typically 1000 A/h. This growth rate is increased 
by about a factor of 10 for plasma aniodization, and further improvements 
are observed if 1 - 3% chlorine is added to the oxygen source. 


Masking 


A selective mask against the diffusion of dopant atoms at high temperatures 
can be found in a silicon dioxide layer, which can prove to be very useful in 
integrated circuit processing. A predeposition of dopant by ion 


implantation, chemical diffusion, or spin-on techniques typically results in a 
dopant source at or near the surface of the oxide. During the initial high- 
temperature step, diffusion in the oxide must be slow enough with respect 
to diffusion in the silicon that the dopants do not diffuse through the oxide 
in the masked region and reach the silicon surface. The required thickness 
may be determined by experimentally measuring, at a particular 
temperature and time, the oxide thickness necessary to prevent the inversion 
of a lightly doped silicon substrate of opposite conductivity. To this is then 
added a safety factor, with typical total values ranging from 0.5 to 0.7 mm. 
The impurity masking properties result when the oxide is partially 
converted into a silica impurity oxide "glass" phase, and prevents the 
impurities from reaching the SiO,-Si interface. 
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Photolithography 


Note:This module is based upon the Connexions module entitled 
Photolithography by Bill Wilson. 


Actually, implants (especially for moats) are usually done at a sufficiently 
high energy so that the dopant (phosphorus) is already pretty far into the 
substrate (often several microns or so), even before the diffusion starts. The 
anneal/diffusion moves the impurities into the wafer a bit more, and as we 
Shall see also makes the n-region grow larger. 


"The n-region"! We have not said a thing about how we make our moat in 
only certain areas of the wafer. From the description we have so far, is 
seems we have simply built an n-type layer over the whole surface of the 
wafer. This would be bad! We need to come up with some kind of 
"window" to only permit the implanting impurities to enter the silicon wafer 
where we want them and not elsewhere. We will do this by constructing an 
implantation "barrier". 


To do this, the first thing we do is grow a layer of silicon dioxide over the 
entire surface of the wafer. We talked about oxide growth when we were 
discussing MOSFETs but let's go into a little more detail. You can grow 
oxide in either a dry oxygen atmosphere, or in a an atmosphere which 
contains water vapor, or steam. In [link], we show oxide thickness as a 
function of time for growth with steam. Dry Oz does not behave too much 
differently, the rate is just somewhat slower. 


Xox oxide thickness (um) 


time (minutes) 


A plot of oxide thickness as a 
function of time. 


On top of the oxide, we are now going to deposit yet another material. This 
is silicon nitride, Si3N, or just plain "nitride" as it is usually called. Silicon 
nitride is deposited through a method called chemical vapor deposition or 
"CVD". The usual technique is to react dichlorosilane and ammonia in a hot 
walled low pressure chemical vapor deposition system (LPCVD). The 
reaction is: 

Equation: 


3 SiH,Cl, + 1ONH, > Si;N, + 6NH,Cl + 6H, 


Silicon nitride is a good barrier for impurities, oxygen and other things 
which do not want to get into the wafer. Take a look at [link] and see what 
we have so far. A word about scale and dimensions. The silicon wafer is 
about 250 pm thick (about 0.01") since it has to be strong enough not to 
break as it is being handled. The two deposited layers are each about 1 pm 
thick, so they should actually be drawn as lines thinner than the other lines 
in the figure. This would obviously make the whole idea of a sketch 
ridiculous, so we will leave things distorted as they are, keeping in mind 
that the deposited and diffused layers are actually much thinner than the rest 


of wafer, which really does not do anything except support the active 
Circuits up on top. 
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silicon 


Initial wafer 
configuration. 


Now what we want to do is remove part of the nitride, so we can make our 
n-well, but not put in phosphorous where do not want it. We do this with a 
processes called photolithography and etching respectively. First thing we 
do is coat the wafer with yet another layer of material. This is a liquid 
called photoresist and it is applied through a process called spin-coating. 
The wafer is put on a vacuum chuck, and a layer of liquid photoresist is 
sprayed uncap of the wafer. The chuck is then spun rapidly, getting to 
several thousand RPM in a small fraction of a second. Centrifugal force 
causes the resist to spread out uniformly across the wafer surface. The 
solvent for the photoresist is quite volatile and so the layer of photoresist 
dries while the wafer is still spinning, resulting in a thin, uniform coating 
across the wafer [link]. 
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silicon 


After the photoresist is 
spun on. 


The name "photoresist" gives some clue as to what this stuff is. Basically, 
photoresist is a polymer mixed with some kind of light sensitizing 
compound. In positive photoresist, wherever light strikes it, the polymer is 
weakened, and it can be more easily removed with a solvent during the 
development process. Conversely, negative photoresist is strengthened 
when it is illuminated with light, and is more resistant to the solvent than is 
the unilluminated photoresist. Positive resist is so-called because the image 
of the developed photoresist on the wafer looks just like the mask that was 
used to create it. Negative photoresist makes an image which is the opposite 
of what the mask looks like. 


We have to come up with some way of selectively illuminating certain 
portions of the photoresist. Anyone who has ever seen a projector know 
how we can do this. But, since we want to make small things, not big ones, 
we will change around our projector so that it makes a smaller image, 
instead of a bigger one. The instrument that projects the light onto the 
photoresist on the wafer is called a projection printer or stepper [link]. 
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A schematic of a stepper 
configuration. 


As shown in [link], the stepper consists of several parts. There is a light 
source (usually a mercury vapor lamp, although ultra-violet excimer lasers 
are also starting to come into use), a condenser lens to image the light 
source on the mask or reticle. The mask contains an image of the pattern we 
are trying the place on the wafer. The projection lens then makes a reduced 
(usually 5x) image of the mask on the wafer. Because it would be far too 
costly, if not just plain impossible, to project onto the whole wafer all at 
once, only a small selected area is printed at one time. Then the wafer is 
scanned or stepped into a new position, and the image is printed again. If 
previous patterns have already been formed on the wafer, TV cameras, with 
artificial intelligence algorithms are used to align the current image with the 
previously formed features. The stepper moves the whole surface of the 
wafer under the lens, until the wafer is completely covered with the desired 
pattern. A stepper is one of the most important pieces of equipment in the 
whole IC fab however, since it determines what the minimum feature size 
on the circuit will be. 


After exposure, the photoresist is placed in a suitable solvent, and 
"developed". Suppose for our example the structure shown in [link] is what 
results from the photolithographic step. 
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After photoresist 
exposure and 
development. 


The pattern that was used in the photolithographic (PL) step exposed half of 
our area to light, and so the photoresist (PR) in that region was removed 
upon development. The wafer is now immersed in a hydrofluoric acid (HF) 


solution. HF acid etches silicon nitride quite rapidly, but does not etch 
silicon dioxide nearly as fast, so after the etch we have what we see in 


After the nitride etch step. 


We now take our wafer, put it in the ion implanter and subject it to a "blast" 
of phosphorus ions [link]. 
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Implanting phospohrus. 


The ions go right through the oxide layer on the RHS, but stick in the 
resist/nitride layer on the LHS of our structure. 


Optical Issues in Photolithography 


Note:This module was developed as part of the Rice University course 
CHEM-496: Chemistry of Electronic Materials. This module was prepared 
with the assistance of Zane Ball. 


Introduction 


Photolithography is one of the most important technology in the production 
of advanced integrated circuits. It is through photolithography that 
semiconductor surfaces are patterned and the circuits formed. In order to 
make extremely small features, on the order of the wavelength of the light, 
advanced optical techniques are used to transfer a pattern from a mask onto 
the surface. A polymeric film or resist, is modified by the light and records 
the information in a process not dissimilar to ordinary photography. 


An illustration of the photolithographic process is shown in [link]. The 
process follows the following basic steps: 


The wafer is spin coated with resist to form a uniform ~1 pm thin film of 
resist on the surface. 

The wafer is exposed with ultraviolet light through a mask which contains 
the desired pattern. In the simplest processes the mask is simply placed over 
the wafer, but advanced sub-micron technologies require the pattern to 
imaged through a complex optical system. 

The photoresist is developed and the irradiated area is washed away 
(positive resist) or the unirradiated area is washed away (negative resist). 
Processing (etching, deposition etc.) 

Remaining resist is stripped. 
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Steps in optical printing using photolithography. 


In addition to being possibly the most important semiconductor process 
step, photolithography is also the most expensive technology in 
semiconductor manufacturing. This expense is the result of two 
considerations: 


1. The optics in photolithography tools are expensive where a single lens 
can cost a $1 million or more 

2. Each chip (often referred to as a "dye") must be exposed individually 
unlike other semiconductor processes such as CVD where an entire 
wafer can be processed at a time or oxidation processes where many 
wafers can be processed simultaneously. 


This means that not only are photolithography machines the most expensive 


of semiconductor processing equipment, but more of them are needed in 
order to maintain throughput. 


Optical issues in photolithography 


The critical dimension and depth of focus 


A semiconductor process technology is often described by a characteristic 
length known as the critical dimension. The critical dimension (CD) is the 
smallest feature that needs to be patterned on the surface. The exact 
definition varies from process to process but is often the channel length of 
the smallest transistor (typical of a memory chip) or the width of the 
smallest metal interconnection line (logic chips). This critical dimension is 
defined by the photolithographic process and is perhaps the most important 
figure of merit in the manufacture of integrated circuits. Making the critical 
dimension smaller is the primary focus of improving semiconductor 
technology for the following reasons: 


1. Making the CD smaller dramatically increases the number of devices 
per unit area and this increase goes with the square of the CD (i.e., a 
reduction in CD by a factor of 2 generates 4 times the number of 
devices). 

2. Making the CD smaller of a device already in production will make a 
smaller chip. This means that the number of chips per wafer increases 
dramatically, and since costs generally scale with the number of wafers 
and not the number of chips to a wafer, costs are dramatically reduced. 

3. Smaller devices are faster. 


Therefore, improvements in lithography technology translate directly into 
better, faster, more complex circuits at lower cost. 


Having established the importance of the critical dimension it is important 
to understand what features of a photolithography system impact. The 
theory behind projection lithography is very well known, dating from the 
original analysis of the microscope by Abbe. It is, in fact, the Abbe sine 
condition that dictates the critical dimension: 

Equation: 
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Xr 
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where the two expressions refer to the limit of a purely coherent 
illuminating source and purely incoherent source respectively, and A is the 
vacuum wavelength of the illuminating light source, n the index of 
refraction of the objective lens, and © refers to the angle between the axis 
of the lens and the line from the back focal point to the aperture of the 
entrance of the lens. The quantity in the denominator, nsin(@) is referred to 
as the numerical aperture or NA. As the degree of coherence can be 
adjusted in a lithography system, the critical dimension is usually written 
more generally as: 

Equation: 
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From this equation, we begin to see what can be done to reduce the critical 
dimension of a lithography system: 


1. Change the wavelength of the source. 
2. Increase the numerical aperture (NA). 
3. Reduce k;. 


Before we discuss how this is accomplished, we must consider one other 
key quantity, the depth of focus or DOF. The depth of focus is the length 
along the axis in which a sharp image exists. Naturally a large DOF is 
desirable for ease of alignment, since the entire dye must with lie within 
this region. In reality, however, the more meaningful constraint is that the 
DOF must be thicker than the resist layer so that the entire volume of resist 
is exposed and can be developed. Also, if the surface morphology of the 
device dictates that the resist to be exposed is not planar, then the DOF 


must be large enough so that all features are properly illuminated. Current 
resists must be 1 pm in thickness in order to have the necessary etch 
resistance, so this can be considered a minimum value for an acceptable 
DOF. The depth of focus can also be expressed as a function of numerical 
aperture and wavelength: 

Equation: 
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If we desire to minimize the critical dimension simply by making optics of 
large numerical aperture that we will simultaneously reduce the depth of 
focus and at a much faster rate owing to the dependence on the square of 
the numerical aperture. 


These two quantities, DOF and CD, provide the direction in lithography and 
semiconductor processing as a whole. For example, a design with an 
improved surface planarity or a new resist that is effective at smaller 
thicknesses would allow for a smaller depth of focus which would in turn 
allow for a larger numerical aperture implying a smaller critical dimension. 
The resist, the source wavelength, and the optical delivery system all affect 
the critical dimension and that further refinements require a multifaceted 
approach to improving lithography systems. What also must be realized is 
that, as far as the optical system is concemed, virtually all that can be done 
with conventional optics has been done and that fundamental restraints on 
k, have been reached. 


Wavefront engineering 


One way to get around the fundamental limitations of an imaging system 
illustrated in [link] is through one of a variety of techniques often termed 
wavefront engineering. Here, not only is the amplitude mapped from the 
object plane to the image plane, but the phase structure of the light going 
through the mask is manipulated to improve the contrast and allow for 
effective values of k,; lower than the theoretical minimum for uniform 


illumination. The most important example of these techniques is the phase 
shift mask or PSM. Here the mask consists of two types of areas, those that 
allow light to pass through unaffected and some regions where the 
amplitude of the light is unaffected but its phase is shifted. The resulting 
electric fields will then sum to zero in some places where use of an ordinary 
mask would have resulted in a positive intensity. 


There are many problems with the practical introduction of various phase 
shifting techniques. Construction of masks with phase shifting elements 
(usually a thin layer of PMMA) is difficult and expensive. Mask damage, 
already a key problem in conventional production techniques, becomes an 
even greater issue as traditional mask repair techniques can no longer be 
used. Also identifying errors in a mask is made more difficult by the odd 
design. 


Interaction with resists 


The ultimate resolution of a photolithographic process is not dependent on 
optics alone, but also on the interaction with the resist. One of the key 
concerns, particularly as wavelengths of sources become shorter, is the 
ability of the source light to penetrate the resist film. Many polymers absorb 
strongly in the UV which can limit the interaction to the surface. In such a 
case only a thin layer of the polymer is exposed and the pattern may not be 
fully uncovered during developing. One important property of resist is the 
presence of saturable absorption.. Saturable absorbers are those absorption 
sites in the polymer that when excited to a higher state remain there for 
relatively long periods of time and do not continue to absorb into higher 
states. If only saturable absorption is present in a polymer film, then 
continued irradiation eventually leads to transparency as all absorption sites 
will be saturated. This allows light penetration through the resist film with 
full exposure to the substrate surface. 


Full penetration of the film leads to a second problem, multiple reflection 
interference. This occurs when light which has penetrated the film to the 
substrate is then reflected back towards the surface. The result is a standing 
wave interference pattern which causes uneven exposure through the film. 


The problem becomes more severe as optical limits are approached where 
feature size is approximately equal to the wavelength of the light source 
meaning such standing waves are the same size as the irradiated features. In 
the most advanced lithography techniques such as 248 nm lithography with 
excimer lasers, a special anti-reflectance coating must be laid down before 
the resist is deposited. Development of an AR coating that has no adverse 
effects during the exposure and development process is difficult. 


One completely new approach to photolithography resists are top-surface- 
imaged resists or TSI resists. These processes do not require light 
penetration through the whole volume of resist. In a TSI resist, a silyl amine 
is selectively in-diffused from the gas phase into a phenolic polymer in 
response to the laser irradiation. This diffusion process creates a silyl ether, 
and development takes place in the form of an oxygen plasma etch, 
sometimes termed 'dry developing’. Depth of focus limitations are thus 
avoided as exposure is necessary only at the surface of the resist layer, and 
the resolution of the etching process determines the final resist profile. Such 
a technique has tremendous advantages, particularly as source wavelengths 
become shorter and transparent polymers more rare. Such as resist has a 
clear optical advantage as well since the image need only be formed at the 
surface of the resist layer reducing the DOF needed to 100 nm or less, 
allowing for larger numerical aperture lithography systems with smaller 
critical dimensions. 


Light sources 


Current photolithography techniques in production utilize ultraviolet lamps 
as the light source. In the most advanced production facilities, 0.35 pm 
mercury i-line technology is used. For the next generation of chips such as 
64 Mbit DRAMS better performance is necessary and either i-line 
technology combined with PSM or a new light source is required. Certainly 
for the 256 Mbit generation using 0.25 jam technology, the i-line source is 
no longer adequate. The apparent successor is the 248 nm KrF laser, which 
entered the most advanced production facilities in the late 1990s. KrF 
technology is often referred to in the literature as Deep UV or DUV 
lithography. For further shrinkage to 0.18 am technology, the ArF excimer 


laser at 193 nm will likely be used with the transition likely to take place in 
the first few years of the next decade. 


At critical dimensions lower than 0.18 - 0.1 pm and below, a whole host of 
technological problems will need to be overcome in every stage of 
manufacturing including photolithography. One likely scheme for future 
lithography is to use X-rays where the wavelength of the light is so much 
smaller than the feature size such that proximity printing can be used. This 
is where the mask is placed close to the surface and an X-ray source is 
scanned across using no optics. Common X-ray sources for such techniques 
include synchrotron radiation and laser produced plasmas. It has also been 
widely suggested that the cost of implementing X-ray or other post-optical 
techniques together with the increased cost of every other manufacturing 
process step will make improvements beyond 0.1 ym cost prohibitive 
where benefits in increased circuit speed and density will be dwarfed by 
massive manufacturing cost. It is noted however that such predictions have 
been made in the past with regard to other technological barriers. 
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Composition and Photochemical Mechanisms of Photoresists 


Note:This module was developed as part of the Rice University course 
CHEM-496: Chemistry of Electronic Materials. This module was prepared 
with the assistance of Angela Cindy Wei. 


Photolithography 


In photolithography, a pattern may be transferred onto a photoresist film by 
exposing the photoresist to light through a mask of the pattern. In the 
semiconductor industry, the photolithographic procedure includes the 
following steps as illustrated in [link]: coating a base material with 
photoresist, exposing the resist through a mask to light, developing the 
resist, etching the exposed areas of the base, and stripping the remaining 
resist off. 
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Steps in optical printing using photolithography. 


Upon exposure to light, the photoresist may become more or less soluble 
depending on the chemical properties of the particular resist material. The 
photochemical reactions include chain scission, cross-linking, and the 
rearrangement of molecules. If the exposed areas of the photoresist become 
more soluble, then it is a positive resist; conversely, if the exposed resist 
becomes less soluble, then it is a negative resist. In developing the 
photoresist, the more soluble material is removed leaving a positive or a 
negative image of the mask pattern. 


Photoresist 


Photoresists were initially developed for the printing industry. In the 1920s, 
the application of photoresists spread to the printed circuit board industry. 
Photoresists for semiconductor use were first developed in the 1950s; 
Kodak developed commercial negative photoresists and shortly after, 
Shipley developed a line of positive resists. Several other companies have 
entered the market since that time in hopes of manufacturing resist products 
which meet the increasing demands of the semiconductor industry: 
narrower line widths, fewer defects, and higher production rates. 


Photoresist composition 


Several functional requirements must be met for a photoresist to be used in 
the semiconductor industry. Photoresist polymers must be soluble for easy 
deposition onto a substrate by spin-coating. Good photoresist-substrate 
adhesion properties are required to minimize undercutting, to maintain edge 
acuity, and to control the feature sizes. The photoresist must be chemically 
resistant to whichever etchants are to be used. Sensitivity of the photoresist 
to a particular light source is essential to the functionality of a photoresist. 
The speed at which chemical changes occur in a photoresist is its contrast. 
The contrast of a resist is dependent on the molecular weight distribution of 


the polymers: a broad molecular weight distribution results in a low contrast 
resist. High contrast resists produce higher resolution images. 


The four basic components of a photoresist are the polymer, the solvent, 
sensitizers, and other additives. The role of the polymer is to either 
polymerize or photosolubilize when exposed to light. Solvents allow the 
photoresist to be applied by spin-coating. The sensitizers control the 
photochemical reactions and additives may be used to facilitate processing 
or to enhance material properties. Photochemical changes to polymers are 
essential to the functionality of a photoresist. Polymers are composed 
primarily of carbon, hydrogen, and oxygen-based molecules arranged in a 
repeated pattern. Negative photoresists are based on polyisopreme 
polymers; negative resist polymers are not chemically bonded to each other, 
but upon exposure to light, the polymers crosslink, or polymerize. Positive 
photoresists are formulated from phenol-formaldehyde novolak resins; the 
positive resist polymers are relatively insoluble, but upon exposure to light, 
the polymers undergo photosolubilization. 


Solvents are required to make the photoresist a liquid, which allows the 
resist to be spun onto a substrate. The solvents used in negative photoresists 
are non-polar organic solvents such as toluene, xylene, and halogenated 
aliphatic hydrocarbons. In positive resists, a variety of organic solvents such 
as ethyl cellosolve acetate, ethoxyethyl acetate, diglyme, or cyclohexanone 
may be used. 


Photosensitizers are used to control or cause polymer reactions resulting in 
the photosolubilization or crosslinking of the polymer. The sensitizers may 
also be used to broaden or narrow the wavelength response of the 
photoresist. Bisazide sensitizers are used in negative photoresists while 
positive photoresists utilize diazonaphthoquinones. One measure of 
photosensitizers is their quantum efficiencies, the fraction of photons which 
result in photochemical reactions; the quantum efficiency of positive 
diazonaphthoquinone photoresist sensitizers has been measured to be 0.2 - 
0.3 and the quantum efficiency of negative bis-arylazide sensitizers is in the 
range of 0.5 - 1.0. 


Additives are also introduced into photoresists depending on the specific 
needs of the application. Additives may be used to increase photon 


absorption or to control light within the resist film. Adhesion promoters 
such as hexamethyldisilazane and additives to improve substrate coating are 
also commonly used. 


Negative photoresist chemistry 


The matrix resin material used in the formulation of these (negative) resists 
is a synthetic rubber obtained by a Ziegler-Natta polymerization of isoprene 
which results in the formation of poly(cis-isoprene). Acid-catalyzation of 
poly(cis-isoprene) produces a partially cyclized polymer material; the 
cyclized polymer has a higher glass transition temperature, better structural 
properties, and higher density. On the average, microelectronic resist 
polyisoprenes contain 1-3 rings per cyclic unit, with 5-20% unreacted 
isoprene units remaining’. The resultant material is extremely soluble in 
non-polar, organic solvents including toluene, xylene, and halogenated 
aliphatic hydrocarbons. 


The condensation of para-azidobenzaldehyde with a substituted 
cyclohexanone produces bis-arylazide sensitizers. To maximize the 
absorption of a particular light source, the absorbance spectrum of the 
photoresist may be shifted by making structural modifications to the 
sensitizers; for example, by using substituted benzaldehydes, the absorption 
peak may be shifted to longer wavelengths. A typical bisazide-cyclized 
polyisoprene photoresist formulation may contain 97 parts cyclized 
polyisoprene to 3 parts bisazide in a (10 wt%) xylene solvent. 


All negative photoresists function by cross-linking a chemically reactive 
polymer via a photosensitive agent that initiates the chemical cross-linking 
reaction. In the bisazide-cyclized polyisoprene resists, the absorption of 
photons by the photosensitive bisazide in the photoresist results in an 
insoluble crosslinked polymer. Upon exposure to light, the bisazide 
sensitizers decompose into nitrogen and highly reactive chemical 
intermediates, called nitrenes [link]. The nitrines react to produce polymer 
linkages and three-dimensional cross-linked structures that are less soluble 
in the developer solution. 

Equation: 
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Positive photoresist chemistry 


Positive photoresist materials originally developed for the printing industry 
have found use in the semiconductor industry. The commonly used novolac 
resins (phenol-formaldehyde copolymer) and (photosensitive) diazoquinone 
both were products of the printing industry. 


The novolak resin is a copolymer of a phenol and formaldehyde ([link]). 
Novolak resins are soluble in common organic solvents (including ethyl 
cellosolve acetate and diglyme) and aqueous base solutions. Commercial 
resists usually contain meta-cresol resins formed by the acid-catalyzed 
condensation of meta-cresol and formaldehyde. 
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Structure of a novolak resin. 


The positive photoresist sensitizers are substituted diazonaphthoquinones. 
The choice of substituents affects the solubility and the absorption 


characteristics of the sensitizers. Common substituents are aryl sulfonates. 
The diazoquinones are formed by a reaction of diazonaphthoquinone 
sulfonyl chloride with an alcohol to form sulfonate ester; the sensitizers are 
then incorporated into the resist via a carrier or bonded to the resin. The 
sensitizer acts as a dissolution inhibitor for the novalac resin and is base- 
insoluble. The positive photoresist is formulated from a novolac resin, a 
diazonaphthoquinone sensitizer, and additives dissolved in a 20 - 40 wt% 
organic solvent. In a typical resist, up to 40 wt% of the resist may be the 
sensitizer. 


The photochemical reaction of quinonediazide is illustrated in [link]. Upon 
absorption of a photon, the quinonediazide decomposes through Wolff 
rearrangement, specifically a Sus reaction, and produces gaseous nitrogen 
as a by-product. In the presence of water, the decomposition product forms 
an indene carboxylic acid, which is base-soluble. However, the formation of 
acid may not be the reason for increased solubility; the release of nitrogen 
gas produces a porous structure through which the developer may readily 
diffuse, resulting in increased solubility. 


Equation: 
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Image reversal 


By introducing an additive to the novolac resins with diazonaphthaquiones 
sensitizers, the resultant photoresist may be used to form a negative image. 
A small amount of a basic additive such as monazoline, imidazole, and 
triethylamine is mixed into a positive novolac resist. Upon exposure to 


light, the diazonaphthaquiones sensitizer forms an indene carboxylic acid. 
During the subsequent baking process, the base catalyzes a thermal 
decarboxylation, resulting in a substituted indene that is insoluble in 
aqueous base. Then, the resist is flood exposed destroying the dissolution 
inhibitors remaining in the previously unexposed regions of the resist. The 
development of the photoresist in aqueous base results in a negative image 
of the mask. 


Comparison of positive and negative photoresists 


Into the 1970s, negative photoresist processes dominated. The poor 
adhesion and the high cost of positive photoresists prevented its widespread 
use at the time. As device dimensions grew smaller, the advantages of 
positive photoresists, better resolution and pinhole protection, suited the 
changing demands of the semiconductor industry and in the 1980s the 
positive photoresists came into prominence. A comparison of negative and 
positive photoresists is given in [link]. 
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A comparison of negative and positive photoresists. 


The better resolution of positive resists over negative resists may be 
attributed to the swelling and image distortion of negative resists during 
development; this prevents the formation of sharp vertical walls of negative 
resist. Disadvantages of positive photoresists include a higher cost and 
lower sensitivity. 


Positive photoresists have become the industry choice over negative 
photoresists. Negative photoresists have much poorer resolution and the 
positive photoresists exhibit better etch resistance and better thermal 
stability. As optical masking processes are still preferred in the 
semiconductor industry, efforts to improve the processes are ongoing. 
Currently, researchers are studying various forms of chemical amplification 
to increase the photon absorption of photoresists. 
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Integrated Circuit Well and Gate Creation 


Note: This module is based upon the Connexions module entitled 
Integrated Circuit Well and Gate Creation by Bill Wilson. 


We then remove the remaining resist, and perform an 
activation/anneal/diffusion step, also sometimes called the "drive-in". The 
purpose of this step is two fold. We want to make the n-tank deep enough so 
that we can use it for our p-channel MOS, and we want to build up an 
implant barrier so that we can implant into the p-substrate region only. We 
introduce oxygen into the reactor during the activation, so that we grow a 
thicker oxide over the region where we implanted the phosphorus. The 
nitride layer over the p-substrate on the LHS protects that area from any 
oxide growth. We then end up with the structure shown in [link]. 


nitride 


After the anneal/drive-in. 


Now we strip the remaining nitride. Since the only way we can convert 
from p to n is to add a donor concentration which is greater than the 
background acceptor concentration, we had to keep the doping in the 
substrate fairly light in order to be able to make the n-tank. The lightly 
doped p-substrate would have too low a threshold voltage for good n-MOS 
transistor operation, so we will do a Vr adjust implant through the thin 
oxide on the LHS, with the thick oxide on the RHS blocking the boron from 
getting into the n-tank. This is shown in [link], where boron is implanted 


into the p-type substrate on the LHS, but is blocked by the thick oxide in 


the region over the n-well. 
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Vy adjust implant. 


Next, we strip off all the oxide, grow a new thin layer of oxide, and then a 
layer of nitride [link]. The oxide layer is grown only because it is bad to 
grow Si3N, directly on top of silicon, as the different coefficients of thermal 
expansion between the two materials causes damage to the silicon crystal 
structure. Also, it turns out to be nearly impossible to remove nitride if it is 
deposited directly on to silicon. 
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Strip of the oxide and 
grow a new nitride layer. 


The nitride is patterned (covered with photoresist, exposed, developed, 
etched, and removal of photoresist) to make two areas which are called 
"active" [link]. The wafer is then subjected to a high-pressure oxidation step 
which grows a thick oxide wherever the nitride was removed. The nitride is 
a good barrier for oxygen, so essentially no oxide grows underneath it. The 


thick oxide is used to isolate individual transistors, and also to make for an 
insulating layer over which conducting patterns can be run. The thick oxide 
is called field oxide (or FOX for short) [link]. 
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Nitride remaining after 
etching. 
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After growth of the field 
oxide (FOX). 


Then, the nitride, and some of the oxide are etched off. The oxide is etched 
enough so that all of the oxide under the nitride regions is removed, which 
will take a little off the field oxide as well. This is because we now want to 
grow the gate oxide, which must be very clean and pure [link]. The oxide 
under the nitride is sometimes called a sacrificial oxide, because it is 
sacrificed in the name of ultra performance. 


Ready to grow gate 
oxide. 


Then the gate oxide is grown, and immediately thereafter, the whole wafer 
is covered with polysilicon [link]. 


Polysilicon deposition 
over the gate oxide. 


The polysilicon is then patterned to form the two regions which will be our 
gates. The wafer is covered once again with photoresist. The resist is 
removed over the region that will be the n-channel device, but is left 
covering the p-channel device. A little area near the edge of the n-tank is 
also uncovered [link]. This will allow us to add some additional phosphorus 
into the n-well, so that we can make a contact there, so that the n-well can 
be connected to Vag. 


Preparing for NMOS 
channel/drain implant. 


Back into the implanter we go, this time exposing the wafer to phosphorus. 
The poly gate, the FOX and the photoresist all block phosphorus from 
getting into the wafer, so we make two n-type regions in the p-type 
substrate, and we have made our n-channel MOS source/drain regions. We 
also add phosphorous to the Vag contact region in the n-well so as the make 


sure we get good contact performance there [link]. 
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Phosphorus S/D implant. 


The formation of the source and drain were performed with a self-aligning 
technology. This means that we used the gate structure itself to define 
where the two inside edges of the source and drain would be for the 
MOSFET. If we had made the source/drain regions before we defined the 
gate, and then tried to line the gate up right over the space between them, 
we might have gotten something that looks like what is shown in [link]. 
What's going to be the problem with this transistor? Obviously, if the gate 


does not extend all the way to both the source and the drain, then the 
channel will not either, and the transistor will never turn on! We could try 
making the gate wider, to ensure that it will overlap both active areas, even 
if it is slightly misaligned, but then you get a lot of extraneous fringing 
capacitance which will significantly slow down the speed of operation of 
the transistor [link]. This is bad! The development of the self-aligned gate 
technique was one of the big breakthroughs which has propelled us into the 
VLSI and ULSI era. 
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A representation of a 
misaligned gate. 
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A representation of a 
wide gate. 


We pull the wafer out of the implanter, and strip off the photoresist. This is 
sometimes difficult, because the act of ion implantation can "bake" the 
photoresist into a very tough film. Sometimes an rf discharge in an O> 
atmosphere is used to "ash" the photoresist, and literally burn it off the 
wafer! We now apply some more PR, and this time pattern to have the moat 
area, and a substrate contact exposed, for a boron p* implant. This is shown 
in [Link]. 
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p-type source 


Boron p-channel S/D 
implant. 


We are almost done. The next thing we do is remove all the photoresist, and 
grow one more layer of oxide, which covers everything, as shown in [link]. 
We put photoresist over the whole wafer again, and pattern for contact holes 
to go through the oxide. We will put contacts for the two drains, and for 
each of the sources, make sure that the holes are big enough to also allow us 
to connect the source contact to either the p-substrate or the n-moat as is 


appropriate [link]. 


Final oxide growth. 


After the contact holes 
are etched. 


Applying Metallization by Sputtering 


Note:This module is adapted from the Connexions module entitled 
Applying Metal/Sputtering by Bill Wilson. 


We now put the wafer in a sputter deposition system. In the sputter system, 
we Coat the entire surface of the wafer with a conductor. An aluminum- 
silicon alloy is usually used, although other metals are employed as well. 


A sputtering system is shown schematically in [link]. A sputtering system is 
a vacuum chamber, which after it is pumped out, is re-filled with a low- 
pressure argon gas. A high voltage ionizes the gas, and creates what is 
known as the Crookes dark space near the cathode, which in our case, 
consists of a metal target made out of the metal we want to deposit. Almost 
all of the potential of the high-voltage supply appears across the dark space. 
The glow discharge consists of argon ions and electrons which have been 
stripped off of them. Since there are about equal number of ions and 
electrons, the net charge density is about zero, and hence by Gauss' law, so 
is the field. 
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A schematic representation of a sputtering 
apparatus. 


The electric field accelerates the argon atoms which slam into the aluminum 
target. There is an exchange of momentum, and an aluminum atom is 
ejected from the target ([link]) and heads to the silicon wafer, where it 


sticks, and builds up a metal film [link]. 
Aluminum Target 


ELECTRIC FIELD 


The sputtering mechanism. 
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Wafer coated with metal. 


If you look at [link], you will note that we have seemingly done something 
pretty stupid. We have wired all of the elements of our CMOS inverter 
together; but all is not lost. We can do one more photolithographic step, and 
pattern and etch the aluminum, so we only have it where we need it. This is 
shown in [link]. 


After interconnect patterning. 


Molecular Beam Epitaxy 


Note: This module was developed as part of the Rice University course 
CHEM-496: Chemistry of Electronic Materials. This module was prepared 
with the assistance of Sarah Westcott. 


Introduction 


In the process of epitaxy, a thin layer of material is grown on a substrate. 
With respect to crystal growth it applies to the process of growing thin 
crystalline layers on a crystal substrate. In epitaxial growth, there is a 
precise crystal orientation of the film in relation to the substrate. For 
electronic devices, the substrate is a single crystal (usually Si or GaAs) and 
therefore so is the epitaxial layer (epilayer). In the most basic form of 
molecular beam epitaxy (MBE), the substrate is placed in ultra high 
vacuum (UHV) and the source materials for the film are evaporated from 
elemental sources. The evaporated molecules or atoms flow as a beam, 
striking the substrate, where they are adsorbed on the surface. Once on the 
surface, the atoms move by surface diffusion until they reach a 
thermodynamically favorable location to bond to the substrate. Molecules 
will dissociate to atomic form during diffusion or at a favorable site. [link] 
illustrates the processes that can occur on the surface. Because the atoms 
require time for surface diffusion, the quality of the film will be better with 
slower growth. Typically growth rates of about 1 monolayer per second 
provide sufficiently high quality. 
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Schematic illustration of processes on 
growing surface during MBE. Adsorption of 
atoms on the surface, surface diffusion of 
atoms, formation of crystalline lattice, 
desorption of particles from the surface. 


A typical MBE chamber is shown in [link]. The substrate is chemically 
washed and then put into a loading chamber where it is further cleaned 
using argon ion bombardment followed by annealing. This removes the top 
layers of the substrate, which is usually an undesired oxide which grew in 
air and contains impurities. The annealing heals any damage caused by the 
bombardment. The substrate then enters the growth chamber via the sample 
exchange load lock. It is secured on a molybdenum holder either 
mechanically or with melted indium or gallium which hold the substrate by 
surface tension. 
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The MBE growth chamber. 


Each effusion cell (see [link]) is a source of one element in the film. The 
effusion cell, also called a Knudsen cell, contains the elemental form in 
very high purity (greater than 99.99999% for Ga and As). The cell is heated 
to encourage evaporation. For GaAs growth, the temperature is typically 
controlled for a vapor pressure of 10° to 10°? Torr inside the effusion cell, 
which results in a transport of about 10!° molecules/cm? to the substrate 
when the shutter for that cell is opened. The shape and size of the opening 
in the cell is optimized for an even distribution of particles on the substrate. 
Due to the relatively low concentration of molecules, they typically do not 
interact with other molecules in the beam during the 5 - 30 cm journey to 
the substrate. The substrate is usually rotated, at a few rpm, to further even 
the distribution. 


Because MBE takes place in UHV and has relatively low pressure of 
residual gas at the surface, analysis techniques such as reflection high 
energy diffraction and ellipsometry can be used during growth, both to 
study and control the growth process. The UHV environment also allows 
pre or post growth analysis techniques such as Auger spectroscopy. 


Elemental and molecular sources 


The effusion cell is used for the majority of MBE growth. All materials 
used in the cell are carefully chosen to be noninteracting with the element 
being evaporated. For example, the crucible is pyrolitic boron nitride. 
However, it has disadvantages, such as: 


e The evaporated species may be molecular, rather than monomeric, 
which will require further dissocation at the surface. 

e When the shutter is opened, the heat loss from the cell results in a 
transient in the beam flux which last for several minutes and cause 
variations of up to 50%. 

e The growth chamber must be opened up to replace the solid sources. 


Cracker cells are used to improve the ratio of monomeric to molecular (or at 
least dimeric to tetrameric) particles from the source. The cracker cell, 
placed so that the beam passes through it after the effusion cell, is 
maintained at a high temperature (and sometimes high pressure) to 
encourage dissociation. The dissociation process generally requires a 
catalyst and the best catalysts for a given species have been studied. 


Some elements, such as silicon, have low enough vapor pressure that more 
direct heating techniques such as electron bombardment or laser radiation 
heating are used. The electron beam is bent using electromagnetic focusing 
to prevent any impurities in the electron source from contaminating the 
silicon to be used in MBE. Because the heat is concentrated on the surface 
to be evaporated, interactions with and contamination from the crucible 
walls is reduced. In addition, this design does not require a shutter, so there 
is no problem with transients. Modulation of the beam can produce very 
sharp interfaces on the substrate. In laser radiation heating, the electron 
beam is replaced by a laser beam. The advantages of localized heating and 
rapid modulation are also maintained without having to worry about 
contamination from the electron source or stray electrons. 


Some of the II-VI (12-16) compounds have such high vapor pressure that a 
Knudson cell cannot be used. For example, the mercury source must be 
kept cooler than the substrate to keep the vapor pressure low enough to be 


feasible. The Hg source must also be sealed off from the growth chamber to 
allow the chamber to be pumped down. 


Two other methods of obtaining the elements for use in epitaxy are gas- 
source epitaxy and chemical beam epitaxy (CBE). Both of these methods 
use gas sources, but they are distinguished by the use of elemental beams in 
gas source epitaxy, while organometallic beams are used in CBE. For the 
example of III-V (13-15) semiconductors, in gas epitaxy, the group III 
material may come from an effusion cell while the group V material is the 
hydride, such as AsH3 or PH3, which is cracked before entering the growth 
chamber. In CBE, the group V material is an organometallic, such as 
triethylgallium [Ga(C>Hs)3] or trimethylaluminum [AI(CH3)3], which 
adsorbs on the surface, where it dissociates. 


The gas sources have several advantages. Gas lines can be run into the 
chamber, which allows the supply to be replenished without opening the 
chamber. When making alloys, such as Al,Ga,_,As, the gases can be 
premixed for the correct stochiometry or even have their composition 
gradually changed for making graded structures. For abrupt structures, it is 
necessary to be able to switch the gas lines with speeds of 1 second or less. 
However, the gas lines increase the complexity of the process and can be 
hard on the pumping system. 


Substrate choice and preparation 


Materials can be grown on substrates of different structure, orientation, and 
chemistry. In deciding which materials can be grown on a particular 
substrate, a primary consideration was expected to be lattice mismatch, i.e., 
differences in spacing between atoms. However, while lattice mismatch can 
cause strain in the grown layer, considerable accommodation between 
materials of different sizes can take place during growth. A greater source 
of strain can be differences in thermal expansion characteristics because the 
layer is grown at high temperature. On cooling to room temperature, 
dislocation defects can be formed at the interface or in severe cases, the 
device may break. Chemical considerations, such as whether the layer's 
elements will dissolve in the substrate or form compounds with the 
substrate, must also be considered. 


Different orientations of the substrate can also affect growth. Close-packed 
planes have the lowest surface energy, which allows atoms to desorb from 
the surface, resulting in slower growth rates. Growth is favored where 
bonds can be made in several directions at the same time. Therefore, the 
substrate is often cut off-axis by a 2 - 4° to provide a rougher growth 
surface. For compound semiconductors, some orientations result in the 
number of loose bonds changing between layers. This results in changes of 
surface energy with each layer, which slows growth down. 


The greatest cause of defects in the epitaxial layer is defects on the 
substrate's surface. In general, any dislocations on the substrate are 
replicated or even multiplied in the epitaxial growth, which is what makes 
the cleaning of the substrate so important. 


Materials grown 


MBE is commercially used primarily for GaAs devices. This is partly 
because the high speed microwave devices made from GaAs required the 
superior electrical quality of epitaxial layers. Taking place at lower 
temperature and under better controlled conditions, MBE generally results 
in layers of better quality than melt-grown. 


From solid Ga and As sources, elemental Ga and tetrameric As, are 
evaporated. For a GaAs substrate, the Ga flux has a sticking coefficient very 
close to 1 (almost certain to adsorb). The As is much less likely to adsorb, 
so an excess is usually supplied. Cracker cells are often used on the As, in 
order to obtain As» instead, which results in faster growth and more 
efficient utilization of the source beam. 


For nominally undoped GaAs grown by MBE, the residual impurities are 
usually carbon, from substrate contamination and residual gas after the 
growth chamber is pumped down, and sulphur, from the As source. The 
most common surface defects are "oval" defects, which seem to form when 
Ga manages to form metallic droplets during the growth process, which can 
particularly occur if the substrate was not cleaned properly. These defects 
can also be reduced by carefully controlling the Ga flux. 


During MBE growth, dopants can be introduced by having a separate 
effusion cell or gas source for each dopant. To achieve a desired dopant 
concentration in the film, not only must the rate of dopants striking the 
substrate be controlled, but the characteristics of how the dopant behaves on 
the surface must be known. Low-vapor pressure dopants tend to desorb 
from the surface and their behavior is very temperature dependent and so 
are avoided when possible. Slow diffusing dopants adsorb to surface sites 
and are eventually covered as more GaAs is grown. Their incorporation 
depends linearly on the partial pressure of the dopant present in the growth 
chamber. This is the behavior exhibited by most n-type dopants in GaAs 
and most dopants of both types in Si. If the dopant, like most p-type GaAs 
dopants, is able to diffuse through the surface of the substrate into the 
crystal below, then there will be higher incorporation efficiency, which will 
depend on the square root of the dopant partial pressure for reasonable 
concentrations. Due to increasing lattice strain, all dopants will saturate at 
very high concentrations. They may also tend to form clusters. Dopant 
behavior depends on many factors and is actively studied. 


The growth of GaAs epitaxial layers on silicon substrates has also been 
investigated. Silicon substrates are grown in larger wafers, have better 
thermal conductivity allowing more devices/chip to be grown on them, and 
are cheaper. However, because Si is nonpolar and GaAs is polar, the GaAs 
tends to form islands on the surface with different phase (what should be a 
Ga site based on a neighboring domain's pattern will actually be an As site). 
There is also a fairly large lattice mismatch, leading to may dislocations. 
However, FETs, LEDs, and lasers have all been made in laboratories. 


Many devices require abrupt junctions between layers of different materials. 
One group, studying how to make high quality, abrupt GaAs and AlAs 
layers, found that rapid movement of the Ga or Al on the surface was 
required. This migration was enhanced at high temperatures, but 
unfortunately, diffusion into the substrate also increased. However, they 
also discovered that migration of Ga or Al increased if the As supply was 
turned off. By alternating the Ga and As supplies, the Ga was able to reach 
the substrate and migrate to provide more even monolayer coverage before 
the As atoms arrived to react. 


Besides GaAs, most other III-V semiconductors have also been grown 
using MBE. Structures involving very thin layers (only a few atomic layers 
thick), often called superlattices or strained superlattices if there is a large 
lattice mismatch, are routinely grown. Because different materials have 
different energy levels for electrons and holes, it is possible to trap carriers 
in one of these thin layers, forming a quantum well. This type of 
confinement structure is particularly popular for LEDs or lasers, including 
blue light lasers. The strained superlattice structure actually shifts and splits 
the energy levels of the materials in some cases making devices possible for 
such applications as infrared light detection, which requires very small band 


gaps. 


Thin films of many other materials have also been grown using MBE 
methods. Silicon technology has cheaper methods of growth and so Si 
layers are not very popular. However, possible devices made of Si-Ge alloys 
have been grown. The II-VI compounds, have also been grown. Magnetic 
materials, such as Co-Pt and Fe-Pt alloys, have been grown in the hopes of 
providing better magnetic storage. 


Analysis techniques 


The most popular in-situ analysis technique for MBE-grown layers is 
reflection high energy diffraction (RHEED), see [link]. Electrons of energy 
5 - 40 keV are directed towards the sample. They reflect from the surface at 
a very small angle (less than 3°) and are directed onto a screen. These 
electrons interact with only the top few atomic layers and thus provide 
information about the surface. [link] shows a typical pattern on the screen 
for electrons reflected from a smooth surface, in which constructive 
interference between some of the electrons reflected from the lattice 
structure results in lines. If the surface is rough, spots will appear on the 
screen, also. By looking at the total intensity of the reflected electron 
pattern, an idea of the number of monolayers deposited and how epilayers 
grow can be obtained. The island-type growth shown in this figure is an 
area of intense interest. These oscillations in intensity are gradually damped 
as more layers are grown, because the overall roughness of the surface 
increases. 


Incident 
electron beam 


Schematic illustrating the formation of a 
RHEED pattern. 


RHEED diffraction pattern of a GaAs surface. Adapted 
from images by the MBE Laboratory in the Institute of 
Physics of the ASCR 
(http://www. fzu.cz/departments/surfaces/mbe/index.html) 


Phase locked epitaxy takes advantage of the patterns of the oscillations to 
grow very abrupt layers. By sending the oscillation information to a 
computer, it can decide when to open or close the shutters of the effusion 
cell based on the location in the oscillation cycle. This technique self- 
adjusts for fluctuations in beam flux when the shutters are opened and can 
grow very abrupt layers. 


Another analysis technique that can be used to study surface smoothness 
during growth is ellipsometry. Polarized laser light is reflected from the 
surface at a small angle. The polarization of the light changes, depending on 
the roughness of the surface. 


Improved growth characteristics also require that the actual flux from the 
sources is measured. This is typically done with an ion gauge flux monitor, 
which is either used to measure residual beam that misses the substrate or is 
moved into the beam path for calibration when a new source is used. 
Because of the importance of clean substrate surfaces for low-defect 
growth, Auger spectroscopy is used following cleaning by sputtering. 
Auger spectroscopy takes place by ionizing an inner shell electron from an 
atom. When an outer shell electron then deexcites to the inner shell, the 
energy released can prompt the emission of another outer shell electron. 
The energy at which this occurs is characteristic of the atom involved and 
the signal can be used to detect impurities as small as 0.1%. 
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Atomic Layer Deposition 


This module was developed as part of the Rice University course CHEM- 
496: Chemistry of Electronic Materials. This module was prepared with the 
assistance of Julie A. Francis. 


Introduction 


The growth of thin films has had dramatic impact on technological 
progress. Because of the various properties and functions of these films, 
their applications are limitless especially in microelectronics. These layers 
can act as superconductors, semiconductors, conductors, insulators, 
dielectric, or ferroelectrics. In semiconductor devices, these layers can act 
as active layers and dielectric, conducting, or ion barrier layers. Depending 
on the type of film material and its applications, various deposition 
techniques may be employed. For gas-phase deposition, these include 
vacuum evaporation, reactive sputtering, chemical vapor deposition (CVD), 
especially metal organic CVD (MOCVD), and molecular beam epitaxy 
(MBE). Atomic layer deposition (ALD), originally called atomic layer 
epitaxy (ALE), was first reported by Suntola et al. in 1980 for the growth of 
zinc sulfide thin films to fabricate electroluminescent flat panel displays. 


ALD refers to the method whereby film growth occurs by exposing the 
substrate to its starting materials alternately. It should be noted that ALE is 
actually a sub-set of ALD, in which the grown film is epitaxial to the 
substrate; however, the terms are often used interchangeably. Although both 
ALD and CVD use chemical (molecular) precursors, the difference between 
the techniques is that the former uses self limiting chemical reactions to 
control in a very accurate way the thickness and composition of the film 
deposited. In this regard ALD can be considered as taking the best of CVD 
(the use of molecular precursors and atmospheric or low pressure) and 
MBE (atom-by-atom growth and a high control over film thickness) and 
combining them in single method. A selection of materials deposited by 
ALD is given in [link]. 


Compound class 


II-VI compounds 


II-VI based thin-film 
electroluminescent 
(TFEL) phosphors 


III-V compounds 


Semiconductors/dielectric 
nitrides 


Metallic nitrides 
Dielectric oxides 
Transparent conductor 
oxides 

Semiconductor oxides 


Superconductor oxides 


Fluorides 


Examples 

ZnS, ZnSe, ZnTe, ZnS;_,Se,, CaS, 
SrS, BaS, SrS,;_,Se,, CdS, CdTe, 
MntTe, HgTe, Hg,_,Cd,Te, 
Cd,_,Mn,Te 

ZnS:M (M = Mn, Tb, Tm), CaS:M (M 
= Eu, Ce, Tb, Pb), SrS:M (M = Ce, 
Tb, Pb, Mn, Cu) 


GaAs, AlAs, AIP, InP, GaP, InAs, 
Al,Ga,-,As, Ga,Inj_,As, Ga,In,_,P 


AIN, GaN, InN, SiN, 


TiN, TaN, Ta3Ns, NbN, MoN 
AloO3, TiO», ZrOp, HfO,, TajOs, 
Nb Os, Y2O3, MgO, CeOsz, SiO», 
La,O3, SrTiO3, BaTiO; 


InyO3, In,O3:Sn, InpO3:F, IngO3:Zr, 
SnO>, SnO>:Sb, ZnO, 


ZnO:Al, GasO3, NiO, CoO, 
Y BayCu307_, 


CaF), SrF5, ZnF» 


Examples of thin film materials deposited by ALD including films 
deposited in epitaxial, polycrystalline or amorphous form. Adapted from M. 
Ritala and M. Leskel, Nanotechnology, 1999, 10, 19. 


How ALD works 


The premise behind the ALD process is a simple one. The substrate 
(amorphous or crystalline) is exposed to the first gaseous precursor 
molecule (elemental vapor or volatile compound of the element) in excess 
and the temperature and gas flow is adjusted so that only one monolayer of 
the reactant is chemisorbed onto the surface ({link]a). The excess of the 
reactant, which is in the gas phase or physisorbed on the surface, is then 
purged out of the chamber with an inert gas pulse before exposing the 
substrate to the other reactant ({link]b). The second reactant then 
chemisorbs and undergoes an exchange reaction with the first reactant on 
the substrate surface ((link]c). This results in the formation of a solid 
molecular film and a gaseous side product that may then be removed with 
an inert gas pulse ({link]d). 


(a) Ist precursor pulse 


f 


Substrate 


(d) Purge (c) 2nd precursor pulse 
8 Pp Pp 


Substrate 


Schematic representation of an ALD process. 


The deposition may be defined as self-limiting since one, and only one, 
monolayer of the reactant species remains on the surface after each 
exposure. In this case, one complete cycle results in the deposition of one 
monolayer of the compound on the substrate. Repeating this cycle leads to a 
controlled layer-by-layer growth. Thus the film thickness is controlled by 
the number of precursor cycles rather than the deposition time, as is the 
case for a CVD processes. This self-limiting behavior is the fundamental 
aspect of ALD and understanding the underlying mechanism is necessary 
for the future exploitation of ALD. 


One basic condition for a successful ALD process is that the binding energy 
of a monolayer chemisorbed on a surface is higher than the binding energy 
of subsequent layers on top of the formed layer; the temperature of the 
reaction controls this. The temperature must be kept low enough to keep the 
monolayer on the surface until the reaction with the second reactant occurs, 
but high enough to re-evaporate or break the chemisorption bond. The 
control of a monolayer can further be influenced with the input of extra 
energy such as UV irradiation or laser beams. The greater the difference 
between the bond energy of a monolayer and the bond energies of the 
subsequent layers, the better the self-controlling characteristics of the 
process. 


Basically, the ALD technique depends on the difference between 
chemisorption and physisorption. Physisorption involves the weak van der 
Waal's forces, whereas chemisorption involves the formation of relatively 
strong chemical bonds and requires some activation energy, therefore it may 
be slow and not always reversible. Above certain temperatures 
chemisorption dominates and it is at this temperature ALD operates best. 
Also, chemisorption is the reason that the process is self-controlling and 
insensitive to pressure and substrate changes because only one atomic or 
molecular layer can adsorb at the same time. 


Equipment for the ALD process 


Equipment used in the ALD process may be classified in terms of their 
working pressure (vacuum, low pressure, atmospheric pressure), method of 


pulsing the precursors (moving substrate or valve sources) or according to 
the types of sources. Several system types are discussed. 


In a typical moving substrate ALD growth system ({link]) the substrate, 
located in the recess part of the susceptor, is continuously rotated and cuts 
through streams of the gaseous precursors, in this case, trimethylgallium 
[TMG, Ga(CH3)3] and arsine (AsH3). These gaseous precursors are 
introduced through separate lines and the gases come in contact with the 
substrate only when it revolves under the inlet tube. This cycle is repeated 
until the required thickness of GaAs is achieved. The exposure time to each 
of the gas streams is about 0.3 s. 


windows 


Substrate in 


recess 
Rotating part 
Quartz tube 


reactor 


Exhaust 


A typical moving substrate ALD growth system used 
to grow GaAs films. Adapted from M. A. Tischler 
and S. M. Bedair, Appl. Phys. Lett., 1986, 48, 1681. 


ALD may be carried out in a vacuum system using an ultra-high vacuum 
with a movable substrate holder and gaseous valving. In this manner it may 
be also equipped with an in-situ LEED system for the direct observation of 
surface atom configurations and other systems such as XPS, UPS, and AES 
for surface analysis. 


A lateral flow system may also be employed for successful ALE deposition. 
This uses an inert gas flow for several functions; it transports the reactants, 
it prevents pump oil from entering the reaction zone, it valves the sources 
and it purges the deposition site between pulses. Inert gas valving has many 
advantages as it can be used at ultra high temperatures where mechanical 
valves may fail and it does not corrode as mechanical valves would in the 
presence of halides. This method is based on the fact that as the inert gas is 
flowing through the feeding tube from the source to the reaction chamber, it 
blocks the flow of the sources. Although in this system the front end of the 
substrate receives a higher flux density than the down-stream end, a 
uniform growth rate occurs as long as the saturation layer of the 
monoformation predominates. This lateral flow system effectively utilizes 
the saturation mechanism of a monolayer formation obtained in ALE. 
Depending on the properties of the precursors used, and on the growth 
temperature, various growth systems may be used for ALE. 


Requirements for ALD growth 


Several parameters must be taken into account in order to assure successful 
ALD growth. These include the physical and chemical properties of the 
source materials, their pulsing into the reactor, their interaction with the 
substrate and each other, and the thermodynamics and volatility of the film 
itself. 


Source molecules used in ALD can be either elemental or an inorganic, 
organic, or organometallic compound. The chemical nature of the precursor 
is insignificant as long as it possesses the following properties. It must be a 
gas or must volatilize at a reasonable temperature producing sufficient 
vapor pressure. The vapor pressure must be high enough to fill the substrate 
area so that the monolayer chemisorption can occur in a reasonable length 
of time. Note that prolonged exposure to the substrate can cause the 


precursor to condense on the surface hindering the growth. Chemical 
interaction between the two precursors prior to chemisorption on the 
surface is also undesired. This may be overcome by purging the surface 
with an inert gas or hydrogen between the pulses. The inert gas not only 
separates the reactant pulses but also cleans out the reaction area by 
removing excess molecules. Also, the source molecules should not 
decompose on the substrate instead of chemisorbing. The decomposition of 
the precursor leads to uncontrolled growth of the film and this defeats the 
purpose of ALD as it no longer is self-controlled, layer-by-layer growth and 
the quality of the film plummets. 


In general, temperature remains the most important parameter in the ALD 
process. There exists a processing window for ideal growth of monolayers. 
The temperature behavior of the rate of growth in monolayer units per cycle 
gives a first indication of the limiting mechanisms of an ALD process. If 
the temperature falls too low, the reactant may condense or the energy of 
activation required for the surface reaction may not be attained. If the 
temperature is too high, then the precursor may decompose or the 
monolayer may evaporate resulting in poor ALD growth. In the appropriate 
temperature window, full monolayer saturation occurs meaning that all 
bonding sites are occupied and a growth rate of one lattice unit per cycle is 
observed. If the saturation density is below one, several factors may 
contribute to this. These include an oversized reactant molecule, surface 
reconstruction, or the bond strength of an adsorbed surface atom is higher 
when the neighboring sites are unoccupied. Then the lower saturation 
density may be thermodynamically favored. If the saturation density is 
above one, then the undecomposed precursor molecules form the 
monolayer. Generally, ideal growth occurs when the temperature is set 
where the saturation density is one. 


Advantages of ALD 


Atomic layer deposition provides an easy way to produce uniform, 
crystalline, high quality thin films. It has primarily been directed towards 
epitaxial growth of III-V (13-15) and II-V (12-16) compounds, especially to 
layered structures such as superlattices and superalloys. This application is 
due to the greatest advantage of this method, it is controllable to an 


accuracy of a single atomic layer because of saturated surface reactions. 
Not only that, but it produces epitaxial layers that are uniform over large 
areas, even on non-planar surfaces, at temperatures lower than those used in 
conventional epitaxial growth. 


Another advantage to this method that may be most important for future 
applications, is the versatility associated with the process. It is possible to 
grow different thin films by choosing suitable starting materials among the 
thousands of available chemical compounds. Provided that the 
thermodynamics are favorable, the adjustment of the reaction conditions is 
a relatively easy task because the process is insensitive to small changes in 
temperature and pressure due to its relatively large processing window. 
There are also no limits in principle to the size and shape of the substrates. 


One advantage that is resultant from the self-limiting growth mechanism is 
that the final thickness of the film is dependent only upon the number of 
deposition cycles and the lattice constant of the material, and can be 
reproduced and controlled. The thickness is independent of the partial 
pressures of the precursors and growth temperature. Under ideal conditions, 
the uniformity and the reproducibility of the films are excellent. ALE also 
has the potential to grow very abrupt heterostructures and very thin layers 
and these properties are in demand for some applications such as 
superlattices and quantum wells. 
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Chemical Vapor Deposition 


Note:This module was developed as part of the Rice University course 
CHEM-496: Chemistry of Electronic Materials. This module was prepared 
with the assistance of Scott Stokes. 


Introduction 


Chemical vapor deposition (CVD) is a deposition process where chemical 
precursors are transported in the vapor phase to decompose on a heated 
substrate to form a film. The films may be epitaxial, polycrystalline or 
amorphous depending on the materials and reactor conditions. CVD has 
become the major method of film deposition for the semiconductor industry 
due to its high throughput, high purity, and low cost of operation. CVD is 
also commonly used in optoelectronics applications, optical coatings, and 
coatings of wear resistant parts. 


CVD has many advantages over physical vapor deposition (PVD) processes 
such as molecular beam evaporation and sputtering. Firstly, the pressures 
used in CVD allow coating of three dimensional structures with large aspect 
ratios. Since evaporation processes are very directional, PVD processes are 
typically line of sight depositions that may not give complete coverage due 
to shadowing from tall structures. Secondly, high precursor flow rates in 
CVD give deposition rates several times higher than PVD. Also, the CVD 
reactor is relatively simple and can be scaled to fit several substrates. Ultra- 
high vacuum is not needed for CVD and changes or additions of precursors 
is an easy task. Furthermore, varying evaporation rates make stoichiometry 
hard to control in physical deposition. While for CVD stoichiometry is 
more easily controlled by monitoring flow rates of precursors. Other 
advantages of CVD include growth of high purity films and the ability to 
fabricate abrupt junctions. 


There are, however, some disadvantages of CVD that make PVD more 
attractive for some applications. High deposition temperatures for some 


CVD processes (often greater than 600 °C) are often unsuitable for 
structures already fabricated on substrates. Although with some materials, 
use of plasma-enhanced CVD or metal-organic precursors may reduce 
deposition temperatures. Another disadvantage is that CVD precursors are 
often hazardous or toxic and the by-products of these precursors may also 
be toxic. Therefore extra steps have to be taken in the handling of the 
precursors and in the treatment of the reactor exhaust. Also, many 
precursors for CVD, especially the metal-organics, are relatively expensive. 
Finally, the CVD process contains a large number of parameters that must 
be accurately and reproducibly optimized to produce good films. 


Kinetics of CVD 


A normal CVD process involves complex flow dynamics since gases are 
flowing into the reactor, reacting, and then by-products are exhausted out of 
the reactor. The sequence of events during a CVD reaction are shown in 
[link] and as follows: 


1. Precursor gases input into the chamber by pressurized gas lines. 

2. Mass transport of precursors from the main flow region to the substrate 
through the boundary layer ([Link]a); 

. Adsorption of precursors on the substrate (normally heated) ({link]b). 

. Chemical reaction on the surface ([link]c) 

. Atoms diffuse on the surface to growth sites. 

. Desorption of by-products of the reactions ({link]d). 

. Mass transport of by-products to the main flow region ([link]e). 
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Sequence of events during CVD: (a) diffusion of reactants through 
boundary layer, (b) adsorption of reactants on substrate, (c) 
chemical reaction takes place, (d) desorption of adsorbed species, 
and (e) diffusion out of by-products through boundary layer. 
Adapted from H. O. Pierson, Handbook of Chemical Vapor 
Deposition, Noyes Publications, Park Ridge (1992). 


The boundary layer 


Gas flow in a CVD reactor is generally laminar, although in some cases 
heating of the chamber walls will create convection currents. The complete 
problem of gas flow through the system is too complex to be described 
here; however, assuming we have laminar flow (often a safe assumption) 
the gas velocity at the chamber walls will be zero. Between the wall (zero 
velocity) and the bulk gas velocity there is a boundary layer. The boundary 
layer thickness increases with lowered gas velocity and the distance from 
the tube inlet ({link]). Reactant gases flowing in the bulk must diffuse 
through the boundary layer to reach the substrate surface. Often, the 


susceptor is tilted to partially compensate for the increasing boundary-layer 
thickness and concentration profile. 


Reactor cell 
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Development of boundary layer in a 
horizontal reactor. Adapted from G. B. 
Stringfellow, Organometallic Vapor-Phase 
Epitaxy: Theory and Practice, Academic 
Press, New York (1994). 


Rate limiting steps 


During CVD the growth rate of the film is limited by either surface reaction 
kinetics, mass transport (diffusion) of precursors to the substrate, or the feed 
rate of the precursors. 


Surface reaction controls the rate when growth occurs at low temperatures 
(where the reaction occurs slowly) and also dominates at low pressures 
(where the boundary layer is thin and reactants easily diffuse to the 
surface), see [link]. Since reactants easily diffuse through the boundary 
layer, the amount of reactant at the surface is independent of reactor 
pressure. Therefore, it is the reactions and motions of the precursors 
adsorbed on the surface which will determine the overall growth rate of the 
film. A sign of surface reaction limited growth would be dependence of the 
growth rate on substrate orientation, since the orientation would certainly 
not affect the thermodynamics or mass transport of the system. 
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Surface reaction limited growth in CVD. Adapted from 
H. O. Pierson, Handbook of Chemical Vapor Deposition, 
Noyes Publications, Park Ridge (1992). 


A deposition limited by mass transport is controlled by the diffusion of 
reactants through the boundary layer and diffusion of by-products out of the 
boundary layer. Mass transport limits reactions when the temperature and 
pressure are high. These conditions increases the thickness of the boundary 
layer and make it harder for gases to diffuse through ([link]). In addition, 
decomposition of the reactants is typically quicker since the substrate is at a 
higher temperature. When mass transport limits the growth, either 
increasing the gas velocity or rotating the substrate during growth will 


decrease the boundary layer and increase the growth rate. 
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Mass transport limited growth in CVD. Adapted from H. 


O. Pierson, Handbook of Chemical Vapor Deposition, 
Noyes Publications, Park Ridge (1992). 


Feed rate limits the deposition when nearly all the reactant is consumed in 
the chamber. The feed rate is more important for a hot wall reactor since the 
heated walls will decompose a large amount of the precursor. Cold wall 
reactors tend to have higher deposition rates since the reactants are not 
depleted by the walls. 


A plot of growth rate versus temperature, known as an Arrhenius plot, can 
be used to determine the rate limiting step of a reaction ([link]). Mass 
transport limits reactions at high temperatures such that growth rate 
increases with partial pressures of reactants, but is constant with 
temperature. Surface reaction kinetics dominates at low temperatures where 
the growth rate increases with temperature, but is constant with pressures of 
reactants. Feed rate limited reactions are independent of temperature, since 
it is the rate of gas delivery that is limiting the reaction. The Arrhenius plot 
will show where the transition between the mass transport limited and the 
surface kinetics limited growth occurs in the temperature regime. 
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Dependence of CVD deposition rate on temperature. 
Adapted from J. G. Eden, in Thin Film Processes IT, Eds. 
J. L. Vossen and W. Kern, Academic Press, New York 
(1991). 


Increases in reactant concentrations will to a point increase the deposition 
rate. However, at very high reactant concentrations, gas phase nucleation 
will occur and the growth rate will drop ([link]). Slow deposition in a CVD 
reactor can often be attributed to either gas phase nucleation, precursor 
depletion due to hot walls, thick boundary layer formation, low 
temperature, or low precursor vapor pressure. 
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Demonstration of deposition rate on reactant 
concentration for CVD deposition. Adapted from J. G. 
Eden, in Thin Film Processes II, Eds. J. L. Vossen and 

W. Kern, Academic Press, New York (1991). 


CVD systems 


Precursor delivery 


Flow of reactants into the reactor must be closely monitored to control 
stoichiometry and growth rate. Precursor delivery is very important since in 
many cases the flow rate can limit the deposition. For low vapor pressure 
solids, a carrier gas is passed over or through a bed of the heated solid to 


transport the vapor into the reactor. Gas flow lines are usually heated to 
reduce condensation of the vapor in the flow lines. In the case of gas 
precursors, mass flowmeters easily gauge and control the flow rates. Liquid 
precursors are normally heated in a bubbler to achieve a desired vapor 
pressure ([link]). 
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Schematic representation of a bubbler for 
liquid precursors. 


An inert gas such as hydrogen is bubbled through the liquid and by 
calculating the vapor pressure of the reactant and monitoring the flow rate 
of the hydrogen, the flow rate of the precursor is controlled by using [link], 
where Qyo is the flow rate of the metal-organic precursor, Qyp is the flow 
rate of hydrogen through the bubbler, Pyyg is the vapor pressure of the 
metal-organic at the bubbler temperature, and Pp is the pressure of the 
bubbler. 

Equation: 


Pao 


Q0= Was — 
" - Pg - Péo 


Another method of introducing liquid precursors involves flash 
vaporization where the liquid is passed into a flask heated above the boiling 


point of the liquid. The gas vapor is then passed through heated lines to the 
CVD chamber. Often, a carrier gas is added to provide a fixed flow rate into 
the reactor. This method of precursor introduction is useful when the 
precursor will decompose if heated over time. A similar technique called 
spray pyrolysis introduces the precursors in the form of aerosol droplets. 
The droplets evaporate in the chamber from the heated gas above the 
substrate or heated chamber walls ([link]). Then the reaction proceeds as a 
normal CVD process. 
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Schematic representation of a typical aerosol delivery 
system for CVD precursors. Adapted from T. T. Kodas 
and M. J. Hamton-Smith, The Chemistry of Metal CVD, 
VCH, New York (1994). 


Thermal CVD reactors 


In thermal CVD temperatures as high as 2000 °C may be needed to 
thermally decompose the precursors. Heating is normally accomplished by 
use of resistive heating, radio frequency (rf) induction heating, or radiant 
heating. There are two basic types of reactors for thermal CVD: the hot wall 
reactor and the cold wall reactor. 


A hot wall reactor is an isothermal furnace into which the substrates are 
placed. Hot wall reactors are typically very large and depositions are done 
on several substrates at once. Since the whole chamber is heated, precise 
temperature control can be achieved with correct furnace design. A 
disadvantage of the hot wall configuration is that deposition occurs on the 
walls of the chamber as well as on the substrate. As a consequence, hot wall 
reactors must be frequently cleaned to reduce flaking of particles from the 
walls which may contaminate the substrates. Furthermore, reactions in the 
heated gas and at the walls deplete the reactants and can result in feed rate 
limited growth. [link] shows a typical low pressure hot wall CVD reactor. 
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Schematic of a typical low pressure hot wall CVD 
reactor used in coating silicon substrates. Adapted from 
H. O. Pierson, Handbook of Chemical Vapor Deposition, 
Noyes Publications, Park Ridge (1992). 


In a cold wall reactor only the substrate is heated, usually by induction or 
radiant heating. Since most CVD reactions are endothermic, deposition is 
preferentially on the area of highest temperature. As a result, deposition is 
only on the substrate and the cooler reactor walls stay clean. Cold wall 
CVD has two main advantages over the hot wall configuration. First, 


particulate contamination is reduced since there are no deposits formed on 
the walls of the reactor. Second, since decomposition only occurs on the 
substrate there is no depletion of source gases due to reaction on the walls. 
However, hot wall reactors tend to have higher throughput since the designs 
more easily accommodate multiple wafer configurations. 


The final issue in design of a thermal CVD reactor is the operating pressure. 
The pressure of the reactor has a large effect on the rate limiting step of the 
deposition. Atmospheric pressure reactors have a large boundary layer 
({link]) and non-uniform diffusion of reactants through the boundary layer 
often results in non-uniform film compositions across the wafer. 
Conversely, low pressure reactors have a nearly non-existent boundary later 
and reactants easily diffuse to the substrate ((link]). However, the difficulty 
in maintaining a uniform temperature profile across the wafer can result in 
thickness non-uniformities since the deposition rate in low pressure reactors 
is strongly temperature dependent. Careful studies of the flow dynamics and 
temperature profiles of CVD reactors are always carried out in order to 
achieve uniform material depositions. 


Plasma-enhanced CVD 


Plasmas are generated for a variety of thin film processes including 
sputtering, etching, ashing, and plasma-enhanced CVD. Plasma-enhanced 
CVD (PECVD), sometimes called plasma-assisted (PAC VD), has the 
advantage that plasma activated reactions occur at much lower temperatures 
compared to those in thermal CVD. For example, the thermal CVD of 
silicon nitride occurs between 700 - 900 °C, the equivalent PECVD process 
is accomplished between 250 - 350 °C. 


A plasma is a partially ionized gas consisting of electrons and ions. Typical 
ionization fractions of 10° to 10°! are encountered in process reactors. 
Plasmas are electrically conductive with the primary charge carriers being 
the electrons. The light mass of the electron allows it to respond much more 
quickly to changes in the field than the heavier ions. Most plasmas used for 
PECVD are generated using a rf electric field. In the high frequency electric 
field, the light electrons are quickly accelerated by the field but do not 


increase the temperature of the plasma because of their low mass. The 
heavy ions cannot respond to the quick changes in direction and therefore 
their temperature stays low. Electron energies in the plasma have a 
Maxwellian distribution in the 0.1 — 20 eV range. These energies are 
sufficiently high to excite molecules or break chemical bonds in collisions 
between electrons and gas molecules. The high energy electrons 
inelastically collide with gas molecules resulting in excitation or ionization. 
The reactive species generated by the collisions do not have the energy 
barriers to reactions that the parent precursors do. Therefore, the reactive 
ions are able to form films at temperatures much lower than those required 
for thermal CVD. 


The general reaction sequence for PECVD is shown in [link]. In addition to 
the processes that occur in thermal CVD, reactive species resulting from 
electron dissociation of parent molecules also diffuse to the surface. The 
reactive species have lower activation energies for chemical reactions and 
usually have higher sticking coefficients to the substrate. Therefore, the 
PECVD reaction is dominated by the reactive species on the surface and not 
any of the the parent precursor molecules that also diffuse to the surface. 
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Reaction sequence in PECVD. Adapted from M. 
Konuma, Film Deposition by Plasma Techniques, 
Springer-Verlag, New York (1992). 


A basic PECVD reactor is shown in [link]. The wafer chuck acts as the 
lower electrode and is normally placed at ground potential. Gases are either 
introduced radially at the edges of the reactor and pumped out from the 
center, or gases can be introduced from the center and pumped at the edges 
as shown in [link]. The magnetic drive allows rotation of the wafers during 
processing to randomize substrate position. Some newer reactors introduce 
the gases through holes drilled in the upper electrode. This method of gas 
introduction gives a more uniform plasma distribution. 
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Schematic representation of a radial flow PECVD 
reactor. Adapted from H. O. Pierson, Handbook of 
Chemical Vapor Deposition, Noyes Publications, Park 
Ridge (1992). 


Plasma CVD has numerous advantages over thermal CVD. Obviously the 
reduced deposition temperature is a bonus for the semiconductor industry 
which must worry about dopant diffusion and metal interconnects melting 
at the temperatures required for thermal CVD. Also, the low pressures 
(between 0.1 - 10 Torr) required for sustaining a plasma result in surface 
kinetics controlling the reaction and therefore greater film uniformity. A 
disadvantage of plasma CVD is that it is often difficult to control 
stoichiometry due to variations in bond strengths of various precursors. For 
example, PECVD films of silicon nitride tend to be silicon rich because of 
the relative bond strength of N> relative to the Si-H bond. Additionally, 
some films may be easily damaged by ion bombardment from the plasma. 


Photochemical CVD 


Photochemical CVD uses the energy of photons to initiate the chemical 
reactions. Photodissociation of the chemical precursor involves the 
absorption of one or more photons resulting in the breaking of a chemical 
bond. The most common precursors for photo-assisted deposition are the 
hydrides, carbonyls, and the alkyls. The dissociation of dimethylzinc by 
[link], a photon creates a zinc radical and a methyl radical (‘CH3) that will 
react with hydrogen in the reactor to produce methane. 

Equation: 


Zn(CH;), + hv (64eV) + H, > Zn + 2CH, 


Like several metal-organics, dimethylzinc is dissociated by the absorption 
of only one UV photon. However, some precursors require absorption of 
more than one photon to completely dissociate. There are two basic 
configurations for photochemical CVD. The first method uses a laser 
primarily as a localized heat source. The second method uses high energy 
photons to decompose the reactants on or near the growth surface. 


In thermal laser CVD, sometimes referred to as laser pyrolysis, the laser is 
used to heat a substrate that absorbs the laser photons. Laser heating of 
substrates is a very localized process and deposition occurs selectively on 
the illuminated portions of the substrates. Except for the method of heating, 
laser CVD is identical to thermal CVD. The laser CVD method has the 
potential to be used for direct writing of features with relatively high 
resolution. The lateral extent of film growth when the substrate is 
illuminated by a laser is determined not only by the spot size of the laser, 
but by the thermal conductivity of the substrate. A variation of laser 
pyrolysis uses a laser to heat the gas molecules such that they are 
fragmented by thermal processes. 


Photochemical effects can be induced by a laser if the precursor molecules 
absorb at the laser wavelength. UV photons have sufficient energy to break 
the bonds in the precursor chemicals. Laser-assisted CVD (LACVD) uses a 
laser, usually an eximer laser, to provide the high energy photons needed to 
break the bonds in the precursor molecules. [link] shows two geometries for 
LACVD. For the perpendicular illumination the photochemical effects 
generally occur in the adsorbed adlayer on the substrate. Perpendicular 
irradiation is often done using a UV lamp instead of a laser so that 
unwanted substrate heating is not produced by the light source. The parallel 
illumination configuration has the benefit that reaction by-products are 
produced further from the growth surface and have less chance of being 
incorporated into the growing film. The main benefit of LACVD is that 
nearly no heat is required for deposition of high quality films. 
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Parallel (a) and perpendicular (b) irradiation 
in laser CVD. Adapted from J. G. Eden, in 
Thin Film Processes II, Eds. J. L. Vossen 
and W. Kern, Academic Press, New York 
(1991). 


An application of laser photolysis is photonucleation. Photonucleation is the 
process by which a chemisorbed adlayer of metal precursors is photolyzed 


by the laser to create a nucleation site for further growth. Photonucleation is 
useful in promoting growth on substrates that have small sticking 
coefficients for gas phase metal atoms. By beginning the nucleation process 
with photonucleation the natural barrier to surface nucleation on the 
substrate is overcome. 
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Liquid Phase Deposition 


Introduction 


Silicon dioxide (silica, SiOz) has been the most researched chemical 
compound apart from water. Silica has been used throughout history, for 
example, flint, which when sharpened formed one of humanities first tools. 
Crystalline silica, or sand, was melted into glass as early as 5000 B.C., 
birthing a technology that has gained sophistication in modern times. 
Silicon is the second most plentiful element in the Earth’s crust, the most 
plentiful being oxygen. It is thus surprising that it was not until 1800 that 
silica was named a compound by Sir Humphry Davy. He, however, failed to 
isolate its components via electrolysis, and it is Jons Jacob Berzelius who is 
thus credited with discovering silica in 1824. He heated potassium 
fluorosilicate with potassium metal and, after purifying the product of this 
reaction with water, produced amorphous silica powder. 


The most common forms of silica employed in industry include a-quartz, 
vitreous silica, silica gel, fumed silica and diatomaceous earth. Synthetic 
quartz is hydrothermally grown from a seed crystal, with aqueous NaOH 
and vitreous SiO», at 400 °C and 1.7 kbar. Because it is a piezoelectric 
material, it is used in crystal oscillators, transducers, pickups and filters for 
frequency control and modulation. Vitreous silica is super cooled liquid 
silica used in laboratory glassware, protective tubing sheaths and vapor 
grown films. Silica gel is formed from the reaction of aqueous sodium 
silicate with acid, after which it is washed and dehydrated. Silica gel is an 
exceptionally porous material with numerous applications including use as 
a dessicant, chromatographic support, catalyst substrate and insulator. 
Pyrogenic or fumed silica is produced by the high temperature hydrolysis, 
in an oxyhydrogen flame, of SiCly. Its applications include use as a 
thickening agent and reinforcing filler in polymers. Diatomaceous earth, the 
ecto-skeletons of tiny unicellular marine algae called diatoms, is mined 
from vast deposits in Europe and North America. Its primary use is in 
filtration. Additional applications include use as an abrasive, insulator, filler 
and a lightweight aggregate. 


Methods of colloidal growth and thin film deposition of amorphous silica 
have been investigated since 1925. The two most common and well- 
investigated methods of forming SiO, in a sol or as a film or coating are 
condensation of alkoxysilanes (known as the Stober method) and hydrolysis 
of metal alkoxides (the Iler or dense silica [DS] process). 


Liquid phase deposition (LPD) 


LPD is a method for the “non-electrochemical production of polycrystalline 
ceramic films at low temperatures.” LPD, along with other aqueous solution 
methods [chemical bath deposition (CBD), successive ion layer adsorption 
and reaction (SILAR) and electroless deposition (ED) with catalyst] has 
developed as a potential substitute for vapor-phase and chemical-precursor 
systems. Aqueous solution methods are not dependent on vacuum systems 
or glove boxes, and the use of easily acquired reagents reduces reliance on 
expensive or sensitive organometallic precursors. Thus, LPD holds potential 
for reduced production costs and environmental impact. Films may be 
deposited on substrates that might not be chemically or mechanically stable 
at higher temperatures. In addition, the use of liquid as a deposition medium 
allows coating of non-planar substrates, expanding the range of substrates 
that are capable of being coated. Aqueous deposition techniques have not 
reached the level of maturation that vapor-phase techniques have in respect 
to a high level of control over composition, microstructure and growth rates 
of the resulting films, but their prospect makes them attractive for research. 


LPD generally refers to the formation of oxide thin films, the most common 
being SiO>, from an aqueous solution of a metal-fluoro complex [MF,]™”, 
which is slowly hydrolyzed using water, boric acid or aluminum metal. 
Addition of water drives precipitation of the oxide. Boric acid and 
aluminum work as fluoride scavengers, rapidly weakening the fluoro 
complex and precipitating the oxide. These reactants are added either drop 
wise or outright, both methods allowing for high control of the hydrolysis 
reaction and of the solution’s supersaturation. Film formation is 
accomplished from highly acidic solutions, in contrast to the basic or 
weakly acidic solutions used in chemical bath deposition. 


A generic description of the LPD reaction is shown in [link], where m is the 
charge on the metal cation. If the concentration of water is increased or the 
concentration of hydrofluoric acid (HF) is decreased, the equilibrium will 
be shifted toward the oxide. Use of boric acid or aluminum metal will 
accomplish the latter, see [link] and [link]. The most popular of these 
methods for accomplishing oxide formation has been through the addition 
of boric acid. 

Equation: 


Equation: 


H,BO,+4HF=—= BF, +H,0*+2H,O 
Equation: 


Al+6HF === H,AIF, + 1.5 H, 


The first patent using liquid phase deposition (LPD) of silicon dioxide via 
fluorosilicic acid solutions (H»SiF,) was granted to the Radio Corporation 
of America (RCA) in 1950. RCA used LPD as a method for coating anti- 
reflective films on glass, but the patent promised further applications. Since 
this initial patent there have been many further patents and papers utilizing 
this method, in variable forms, to coat substrates, usually silicon, with 
silicon dioxide. The impetus behind this work is to create an alternative to 
the growth of insulator coatings by thermal oxidation or chemical vapor 
deposition (CVD) for planar silicon chip technology. Thermal oxidation and 
CVD are performed at elevated temperatures, requiring a higher output of 
energy and more complicated instrumentation than that of LPD. The most 
simple and elegant of the LPD methods uses only water to catalyze silica 
thin film growth on silicon from a solution of fluorosilicic acid 
supersaturated with silicon dioxide, [link]. 

Equation: 


H,SiF,+2H,O === SiO, +6HF 


The amount of water reacted with the supersaturated fluorosilicic acid 
solution controls both the growth rate and incorporation of fluorine into the 
resulting silica matrix. Both growth rate and fluorine content increase with 
increased addition of water. Ultimately this “dilution” affects the optical 
properties of the resulting silica film; an increased amount of fluorine 
decreases its dielectric constant (and thus its refractive index). 


To ensure a uniform film growth with LPD, the preparation of the surface to 
be coated is of utmost importance. Suitable treatments may involve the 
formation of surface hydroxides, the pre-deposition or self-assembly of an 
appropriate seed layer. The most efficient coverage is seen with silicon 
surfaces functionalized with hydroxy (-OH) groups prior to immersion in 
the growth solution. This can be achieved through appropriate etching of 
the silicon surface. It is proposed that the silanol (Si-OH) groups act to seed 
the growth of the silica film through condensation reactions with the silicic 
acid formed in the growth solution. 


Lee and co-workers and Homma separately propose that intermediate, 
hydrolyzed species, SiF,(OH)4_, (n < 4), are formed by the reaction shown 
in [link]. According to Lee, these species then react with the substrate 
surface to form a film. Homma proposes that fluorine-containing siloxanes 
are subsequently formed, which adsorb onto the surface where 
condensation and bonding occurs between the oligomers and surface 
hydroxyl groups. The former mechanism implies a molecular growth 
mechanism, whereas the latter implies homogeneous nucleation with 
subsequent deposition. 

Equation: 


H,SiF, + (4-n)H,0 === SiF,(OH),,, + (6-n) HF 
In concentrated fluorosilicic acid solutions silica can be dissolved to well 


beyond its solubility, forming fluorosilicon complexes such as [SiFg.SiF4]*, 
[link]. The bridged fluorosilicon complex has electron deficient silicon 


because of the high electronegativity of the bonded fluorines, creating weak 
Si-F bonds. These bonds are then prone to nucleophilic attack by water. The 
fluorine ion (F-) combines with the proton in this reaction to form 
hydrofluoric acid (HF). The product of this reaction can then react further 
with water to yield [SiF,(OH),]*, SiF, and HF. The high acidity of the 
solution then allows protons to react with [SiF,(OH),|*" to form 
tetrafluorosilicate (SiF,4) and water, [link]. Hydrolysis of the SiF, will then 
yield the hexafluorosilicate anion, protons and silicic acid, [link]. 
Equation: 


5 H,SiF, + SiO, > 3 [SiF,SiF,|> + 2H,O + 6 H* 
Equation: 

[SiF,(OH),|? + 2H* — SiF,+ 2H,O 
Equation: 


3 SiF,+4H,O — 2 SiF,* + Si(OH), + 4 H* 


Silicic acid is adsorbed onto the surface of the substrate that has been 
introduced into the growth solution. Molecular growth of silica on the 
substrate surface is initialized in an acid catalyzed dehydration between the 
silicic acid and the silanol groups on the substrate surface. Si-O-Si bonds 
are formed, resulting in an initial silica coating of the surface. Following 
reactions between the initial silica coating and the monosilicic acid in 
solution result in further silica deposition and growth. Because of the 
presence of HF in the solution, the surface and growing silica matrix is 
subject to attack according to the reaction in [link]. This explains the 
incorporation of a quantity of fluorine into the silica film. Additionally, it 
reveals that a certain amount of silica etching occurs along with growth. 
Because of the prevalence of the silicic acid in the solution, however, 
deposition is predominant. 

Equation: 


Si-OH + HF — Si-F+H,O 


This proposed mechanism, which is more in depth than those proposed by 
Lee and Homma, elucidates what is experimentally proven. The deposition 
rate of the silica increases with addition of H»O because the nucleophilic 
attack of the fluorosilicon complex is then augmented, increasing the 
concentration of silicic acid in the growth solution. The H2O addition 
increases the reaction rate and thus the concentration of HF in the growth 
solution, resulting in greater incorporation of fluorine into the silica matrix 
because of HF attack of the deposited film. Additionally, Yeh’s mechanism 
supports a molecular growth model, i.e., heterogeneous growth, which 
represents a consensus of the body of research performed thus far. 


In a solution with dissolved ceramic precursors, nucleation and growth will 
occur either in solution (homogenous nucleation) or on the surfaces of 
introduced solid phases (heterogeneous nucleation). Successful film 
formation relies on the promotion of heterogeneous nucleation. Solubility 
generally depends on the solution pH and the concentration of the species in 
solution. As the solution crosses over from a solvated state to a state of 
supersaturation, film formation can occur. It is vital to assure that the state 
of supersaturation is one that promotes film growth and not homogeneous 
nucleation and precipitation. This concept is illustrated in [link]. 
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Idealized solubility diagram for film forming 
species in water. Adapted from B. C. Bunker, P. C. 
Rieke, B. J. Tarasevich, A. A. Campbell, G. E. 
Fryxall, G. L. Graff, L. Song, J. Liu, J. W. Virden, 
and G. L. McVay, Science, 1994, 264, 48. 


Silica can be dissolved in fluorosilicic acid to well above its solubility in 
water, which is approximately 220 ppm (mg/L). Depending on the 
concentration of the fluorosilicic acid solution, it can contain up to 20% 
more silica than is implied by the formula H»SiF,. After saturation of the 
solution with SiO», the solvated species is a mixture of fluorosilicates, 
which reacts as explained earlier. It must be emphasized that addition of 
water in this reaction is not simply dilution, but is the addition of a reactant, 
which places the solution in a metastable state (the blue area in [link]) in 
preparation for the introduction of a suitable surface to seed the growth of 
silica. 


Another important factor in solution growth methods is interfacial energy. 
When a substrate with lower interfacial energy than that of a growing 
homogeneous nucleus is introduced into a growth solution, heterogeneous 
growth is favored. Thus, a seeded growth mechanism by definition 
introduces a substrate of lower interfacial energy into a supersaturated 
solution, facilitating heterogeneous growth. Lower interfacial energies can 
be a product of surface modification, as well as a property of the materials’ 
natural state. 


Comparing LPD to sol-gel 


An alternative method to LPD for forming silica thin films is the sol-gel 
method. A sol is a colloidal dispersion of particles in a liquid. A gel is a 
material that contains a continuous solid matrix enclosing a continuous 
liquid phase. The liquid inhibits the solid from collapsing and the solid 
impedes release of the liquid. A formal definition of sol-gel processing is 
the “growth of colloidal particles and their linking together to form a gel.” 
This method describes both the hydrolysis and condensation of silicon 
alkoxides and the hydrolysis and condensation of aqueous silicates (the DS 
process). 


In the hydrolysis of silicon alkoxides, an alkoxide group is replaced with a 
hydroxyl group, [link]. Further condensation reactions between alkoxyl 


groups or hydroxyl groups produce siloxane bonds, see [link] and [link]. 
Equation: 


=Si-OR+H,O =— =Si-OH + ROH 
Equation: 
=Si-OH + RO-Siz ==—* =Si-O-Si= + ROH 


Equation: 


=Si-OH+HO-Sie =— =Si-O-Si= +H,O 


Tetramethoxysilane [Si(QMe)4, TMOS] and tetraethylorthoxysilane 
[Si(OEt)4, TEOS] are the most commonly used precursors in silica sol-gel 
processing. The alkoxides are hydrolyzed in their parent alcohols, with a 
mineral acid or base catalyst, producing silicate gels that can be deposited 
as coatings. The Stober method, which utilizes this chemistry, relies on 
homogeneous nucleation to produce monodisperse sols. 


Iler’s DS method of silica film formation was originally patented as a 
pigment coating to increase dispersibility of titania particles for use in the 
paint industry. The DS method is based on the aqueous chemistry of silica 
and takes advantage of the species present in solution at varying pH. Below 
pH 7 three-dimensional gel networks are formed. Above pH 7 silica 
surfaces are quite negatively charged ([link]), so that particle growth occurs 
without aggregation. The isoelectric point of silica is pH 2. Reactions above 
and below pH 2 are thought to occur through bimolecular nucleophilic 
condensation mechanisms. Above pH 2 an anionic species attacks a neutral 
species ([link]) and below pH 2 condensation involves a protonated silanol 
({link]). The DS process has been utilized extensively in sol-gel coating 
technology and as a growth method for monodisperse and polydisperse sols. 
Equation: 


Si(OH), (aq) —> Si(OH),O° + H* 
Equation: 

SiO’ + =Si-OH —> =Si-O-Si= +OH- 
Equation: 


=SiOH,* + HO-Si= — =Si-O-Si= + H* 
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Selecting a Molecular Precursor for Chemical Vapor Deposition 


Introduction 


The proven utility of chemical vapor deposition (CVD) in a wide range of 
electronic materials systems (semiconductors, conductors, and insulators) 
has driven research efforts to investigate the potential for thin film growth 
of other materials, including: high temperature superconducting metal 
oxides, piezoelectric material, etc. Moreover, CVD potentially is well suited 
for the preparation of thin films on a wide range of substrates, including 
those of nonplanar geometries. CVD offers the advantages of mild process 
conditions (i.e., low temperatures), control over microstructure and 
composition, high deposition rates, and possible large scale processing. As 
with any CVD process, however, the critical factor in the deposition process 
has been the selection of precursors with suitable transport properties. 


Factors in selecting a CVD precursor molecule 


The following properties are among those that must be considered when 
selecting suitable candidates for a CVD precursor: 


1. The precursor should be either a liquid or a solid, with sufficient vapor 
pressure and mass transport at the desired temperature, preferably 
below 200 °C. Liquids are preferred over solids, due to the difficulty 
of maintaining a constant flux of source vapors over a non-equilibrium 
percolation (solid) process. Such non-bubbling processes are a 
function of surface area, a non-constant variable with respect both to 
time and particle size. The upper temperature limit is not dictated by 
chemical factors; rather, it is a limitation imposed by the stability of 
the mass flow controllers and pneumatic valves utilized in commercial 
deposition equipment. It must be stressed that while the achievement 
of an optimum vapor pressure for efficient utilization as an industrially 
practicable source providing high film growth rates (>10 Torr at 25 °C) 
is a worthy goal, the usable pressure regimes are those in which 
evaluation can be carried out on compounds which exhibit vapor 
pressures exceeding 1 Torr at 100 °C. 


2. The precursor must be chemically and thermally stable in the region 
bordered by the evaporation and transport temperatures, even after 
prolonged use. Early workers were plagued by irreproducible film 
growth results caused by premature decomposition of source 
compounds in the bubbler, in transfer lines, and, basically everywhere 
except on the substrate. Such experiences are to be avoided! 

3. By its very nature, CVD demands a decomposable precursor. This 
generally is accomplished thermally; however, the plasma-enhanced 
growth regime has seen much improvement. In addition, photolytic 
processes have tremendous potential. Nevertheless, the precursor must 
be thermally robust until deposition conditions are employed. 

4. The precursor should be relatively easy to synthesize, ensuring 
sufficient availability of material for testing and fabrication. It also is 
important that the synthesis of the compound be reproducible. It 
should be simple to prepare and purify to a relatively high level of 
purity. It should be non-toxic and environmentally friendly (i.e., as low 
a toxicity as can be attained, given the fundamental toxicity of 
particular elements such as mercury, thallium, barium, etc.). It should 
be routine to reproduce and scale-up the preparation for further 
developmental studies. It should utilize readily available starting 
reagents, and proceed by a minimum number of chemical 
transformations in order to minimize the cost. 

5. Due to handling considerations, the source should be oxidatively, 
hydrolytically, thermally and photochemically stable under normal 
storage conditions, in addition the precusor should resist 
oligomerization (in the solid, liquid, or gaseous states). It is worth 
noting that practitioners of metal organic CVD (MOCVD), especially 
for 13-15 materials have of necessity become expert in the handling of 
very toxic, highly air sensitive materials. 


Historically, researchers were limited in their choices of precursors to those 
that were readily known and commercially available. It must be emphasized 
that none of these previously known compounds had been designed 
specifically to serve as vapor phase transport molecules for the associated 
element. Thus, the scope was often limited to what was commercially 
available. However, as new compounds have now been made with the 


specific goal of providing ideal CVD precursors the choice to academia and 
industry has increased. 
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Determination of Sublimation Enthalpy and Vapor Pressure for Inorganic 
and Metal-Organic Compounds by Thermogravimetric Analysis 


Introduction 


Metal compounds and complexes are invaluable precursors for the chemical 
vapor deposition (CVD) of metal and non-metal thin films. In general, the 
precursor compounds are chosen on the basis of their relative volatility and 
their ability to decompose to the desired material under a suitable 
temperature regime. Unfortunately, many readily obtainable (commercially 
available) compounds are not of sufficient volatility to make them suitable 
for CVD applications. Thus, a prediction of the volatility of a metal-organic 
compounds as a function of its ligand identity and molecular structure 
would be desirable in order to determine the suitability of such compounds 
as CVD precursors. Equally important would be a method to determine the 
vapor pressure of a potential CVD precursor as well as its optimum 
temperature of sublimation. 


It has been observed that for organic compounds it was determined that a 
rough proportionality exists between a compound’s melting point and 
sublimation enthalpy; however, significant deviation is observed for 
inorganic compounds. 


Enthalpies of sublimation for metal-organic compounds have been 
previously determined through a variety of methods, most commonly from 
vapor pressure Measurements using complex experimental systems such as 
Knudsen effusion, temperature drop microcalorimetry and, more recently, 
differential scanning calorimetry (DSC). However, the measured values are 
highly dependent on the experimental procedure utilized. For example, the 
reported sublimation enthalpy of Al(acac)3 ([link]a, where M = Al, n = 3) 
varies from 47.3 to 126 kJ/mol. 


Structure of a typical metal B-diketonate 

complex. (a) acetylacetonate (acac); (b) 

trifluoro acetylacetonate (tfac), and (c) 
hexafluoroacetylacetonate (hfac). 


Thermogravimetric analysis offers a simple and reproducible method for 
the determination of the vapor pressure of a potential CVD precursor as 
well as its enthalpy of sublimation. 


Determination of sublimation enthalpy 


The enthalpy of sublimation is a quantitative measure of the volatility of a 
particular solid. This information is useful when considering the feasibility 
of a particular precursor for CVD applications. An ideal sublimation 
process involves no compound decomposition and only results in a solid- 
gas phase change, i.e., [link]. 

Equation: 


IM(L),, I sotia) a IM(L), | vapor) 
Since phase changes are thermodynamic processes following zero-order 


kinetics, the evaporation rate or rate of mass loss by sublimation (m,,p), at a 
constant temperature (T), is constant at a given temperature, [link]. 


Therefore, the m,,, values may be directly determined from the linear mass 
loss of the TGA data in isothermal regions. 
Equation: 


m,,» = _Almass] 
At 


The thermogravimetric and differential thermal analysis of the compound 
under study is performed to determine the temperature of sublimation and 
thermal events such as melting. [link] shows a typical TG/DTA plot for a 
gallium chalcogenide cubane compound ({link]). 
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A typical thermogravimetric/differential 
thermal analysis (TG/DTA) analysis of 
[(EtMe,C)GaSe],, whose structure is shown 
in [link]. Adapted from E. G. Gillan, S. G. 
Bott, and A. R. Barron, Chem. Mater., 1997, 
9, 3, 796. 


Structure of 
gallium 
chalcogenide 
cubane 
compound, 
where E = S, 
Se, and R = 
CMes, 
CMe>Ft, 
CEt Me, 
CEts. 


Data collection 


In a typical experiment 5 - 10 mg of sample is used with a heating rate of 
ca. 5 °C/min up to under either a 200-300 mL/min inert (N> or Ar) gas flow 
or a dynamic vacuum (ca. 0.2 Torr if using a typical vacuum pump). The 
argon flow rate was set to 90.0 mL/min and was carefully monitored to 
ensure a steady flow rate during runs and an identical flow rate from one set 
of data to the next. 


Once the temperature range is defined, the TGA is run with a 
preprogrammed temperature profile ({link]). It has been found that 
sufficient data can be obtained if each isothermal mass loss is monitored 
over a period (between 7 and 10 minutes is found to be sufficient) before 


moving to the next temperature plateau. In all cases it is important to 
confirm that the mass loss at a given temperature is linear. If it is not, this 
can be due to either (a) temperature stabilization had not occurred and so 
longer times should be spent at each isotherm, or (b) decomposition is 
occurring along with sublimation, and lower temperature ranges must be 
used. The slope of each mass drop is measured and used to calculate 
sublimation enthalpies as discussed below. 


Temperature (°C) 


Isotherm 


0 10 20 30 
Time (min.) 


A typical temperature profile for 
determination of isothermal mass loss rate. 


As an illustrative example, [link] displays the data for the mass loss of 
Cr(acac)3 ([link]a, where M = Cr, n = 3) at three isothermal regions under a 
constant argon flow. Each isothermal data set should exhibit a linear 
relation. As expected for an endothermal phase change, the linear slope, 
equal to Mgyp, increases with increasing temperature. 
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Plot of TGA results for Cr(acac)3 performed 
at different isothermal regions. Adapted 
from B. D. Fahlman and A. R. Barron, Adv. 
Mater. Optics Electron., 2000, 10, 223. 


Note:Samples of iron acetylacetonate ({link]a, where M = Fe, n = 3) may 
be used as a calibration standard through AH,,,, determinations before each 
day of use. If the measured value of the sublimation enthalpy for Fe(acac)3 
is found to differ from the literature value by more than 5%, the sample is 
re-analyzed and the flow rates are optimized until an appropriate value is 
obtained. Only after such a calibration is optimized should other 
complexes be analyzed. It is important to note that while small amounts (< 
10%) of involatile impurities will not interfere with the AH,,, analysis, 
competitively volatile impurities will produce higher apparent sublimation 
rates. 


It is important to discuss at this point the various factors that must be 
controlled in order to obtain meaningful (useful) m,,, data from TGA data. 


1. The sublimation rate is independent of the amount of material used but 
may exhibit some dependence on the flow rate of an inert carrier gas, 
since this will affect the equilibrium concentration of the cubane in the 
vapor phase. While little variation was observed we decided that for 
consistency Mg, values should be derived from vacuum experiments 
only. 

2. The surface area of the solid in a given experiment should remain 
approximately constant; otherwise the sublimation rate (i.e., 
mass/time) at different temperatures cannot be compared, since as the 
relative surface area of a given crystallite decreases during the 
experiment the apparent sublimation rate will also decrease. To 
minimize this problem, data was taken over a small temperature ranges 
(ca. 30 °C), and overall sublimation was kept low (ca. 25% mass loss 
representing a surface area change of less than 15%). In experiments 
where significant surface area changes occurred the values of M.yp 
deviated significantly from linearity on a log(m,,,) versus 1/T plot. 

3. The compound being analyzed must not decompose to any significant 
degree, because the mass changes due to decomposition will cause a 
reduction in the apparent m,,, value, producing erroneous results. With 
a simultaneous TG/DTA system it is possible to observe exothermic 
events if decomposition occurs, however the clearest indication is 
shown by the mass loss versus time curves which are no longer linear 
but exhibit exponential decays characteristic of first or second order 
decomposition processes. 


Data analysis 


The basis of analyzing isothermal TGA data involves using the Clausius- 
Clapeyron relation between vapor pressure (p) and temperature (T), [link], 
where AH,,p is the enthalpy of sublimation and R is the gas constant (8.314 
J/K.mol). 

Equation: 


Since msub data are obtained from TGA data, it is necessary to utilize the 
Langmuir equation, [link], that relates the vapor pressure of a solid with its 
sublimation rate. 

Equation: 


0.5 
_ 2a , 
= My sub 


After integrating [link] in log form, substituting in [link], and consolidating 
the constants, one obtains the useful equality, [link]. 


Equation: 
=——— 9.0522(AHsub) — 
leptin el) 0.0522(AHsub ) 4 | 0.0522(AHsub 1 log 1306 
T Tsub Mw 


Hence, the linear slope of a log(m,,, 1/7) versus 1/T plot yields AHs,». An 
example of a typical plot and the corresponding AH, value is shown in 
[link]. In addition, the y intercept of such a plot provides a value for T.,p, 
the calculated sublimation temperature at atmospheric pressure. 
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Plot of log(m,,,T 7) versus 1/T and the 
determination of the AH,,, (112.6 kJ/mol) 
for Fe(acac)3 (R* = 0.9989). Adapted from 

B. D. Fahlman and A. R. Barron, Adv. 

Mater. Optics Electron., 2000, 10, 223. 


[link] lists the typical results using the TGA method for a variety of metal 
B-diketonates, while [link] lists similar values obtained for gallium 
chalcogenide cubane compounds. 


Compound AH sub AScub ain Calculated 
(kJ/mol) (J/K.mol) calc. vapor 
(°C) pressure @ 
150 °C 
(Torr) 


Al(acac)3 
Al(tfac)3 

Al(hfac)3 
Cr(acac)3 
Cr(tfac)3 

Cr(hfac)3 
Fe(acac)3 
Fe(tfac)3 

Fe(hfac)s3 
Co(acac)3 
Co(tfac)3 


Co(hfac)3 


93 


74 


a2 


91 


71 


46 


112 


96 


60 


138 


119 


73 


220 


192 


152 


216 


186 


134 


259 


243 


169 


311 


295 


200 


150 


111 


70 


148 


109 


69 


161 


121 


81 


170 


131 


90 


3.261 


9.715 


29.120 


3.328 


9.910 


29.511 


2.781 


8.340 


25.021 


1.059 


3.319 


9.132 


Selected thermodynamic data for metal B-diketonate compounds 
determined from thermogravimetric analysis. Data from B. D. Fahlman and 


A. R. Barron, Adv. Mater. Optics Electron., 2000, 10, 223. 


Compound 


AHgup 
(kJ/mol) 


ASsub T sub 
calc. 
mol) (°C) 


(J/K. 


Calculated 
vapor 
pressure 


@ 150°C 


(Torr) 
[(Me3C)GaS ]4 110 300 94 2203 
[(EtMe,C)GaS ], 124 330 102 18.89 
[(EtyMeC)GaS ]4 137 339 131 1.173 
[(Et3C)GaS ]4 149 333 175 0.018 
[(Me3C)GaSe) |, 119 305 116 3.668 
[(EtMe,C)GaSe], 137 344 124 2.562 
[(EtyMeC)GaSe], 147 359 136 0.815 
[(EtzC)GaSe], 156 339 189 0.005 


Selected thermodynamic data for gallium chalcogenide cubane compounds 
determined from thermogravimetric analysis. Data from E. G. Gillan, S. G. 
Bott, and A. R. Barron, Chem. Mater., 1997, 9, 3, 796. 


A common method used to enhance precursor volatility and corresponding 
efficacy for CVD applications is to incorporate partially ([link]b) or fully 
({link]c) fluorinated ligands. As may be seen from [link] this substitution 
does results in significant decrease in the AH,,, and thus increased 
volatility. The observed enhancement in volatility may be rationalized either 
by an increased amount of intermolecular repulsion due to the additional 
lone pairs or that the reduced polarizability of fluorine (relative to 
hydrogen) causes fluorinated ligands to have less intermolecular attractive 
interactions. 


Determination of sublimation entropy 


The entropy of sublimation is readily calculated from the AH,,,, and the 
calculated T.,, data, [link]. 


Equation: 


ASwup 7 AH up 
T 


sub 


[link] and [link] show typical values for metal B-diketonate compounds and 
gallium chalcogenide cubane compounds, respectively. The range observed 
for gallium chalcogenide cubane compounds (AS, = 330 +20 J/K.mol) is 
slightly larger than values reported for the metal $-diketonates compounds 
(ASgyp = 130 - 330 J/K.mol) and organic compounds (100 - 200 J/K.mol), 
as would be expected for a transformation giving translational and internal 
degrees of freedom. For any particular chalcogenide, i.e., [((R)GaS],, the 
lowest AS,,,, are observed for the Me3C derivatives, and the largest AS, 
for the EtpMeC derivatives, see [link]. This is in line with the relative 
increase in the modes of freedom for the alkyl groups in the absence of 
crystal packing forces. 


Determination of vapor pressure 


While the sublimation temperature is an important parameter to determine 
the suitability of a potential precursor compounds for CVD, it is often 
preferable to express a compound's volatility in terms of its vapor pressure. 
However, while it is relatively straightforward to determine the vapor 
pressure of a liquid or gas, measurements of solids are difficult (e.g., use of 
the isoteniscopic method) and few laboratories are equipped to perform 
such experiments. Given that TGA apparatus are increasingly accessible, it 
would therefore be desirable to have a simple method for vapor pressure 
determination that can be accomplished on a TGA. 


Substitution of [link] into [link] allows for the calculation of the vapor 
pressure (p) as a function of temperature (T). For example, [link] shows the 
calculated temperature dependence of the vapor pressure for [((Me3C)GaS]j. 
The calculated vapor pressures at 150 °C for metal 6-diketonates 
compounds and gallium chalcogenide cubane compounds are given in [link] 
and [link]. 
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A plot of calculated vapor pressure (Torr) 
against temperature (K) for [((Me3C)GaS],. 
Adapted from E. G. Gillan, S. G. Bott, and 
A. R. Barron, Chem. Mater., 1997, 9, 3, 796. 


The TGA approach to show reasonable agreement with previous 
measurements. For example, while the value calculated for Fe(acac)3 (2.78 
Torr @ 113 °C) is slightly higher than that measured directly by the 
isoteniscopic method (0.53 Torr @ 113 °C); however, it should be noted 
that measurements using the sublimation bulb method obtained values 
much lower (8 x 10° Torr @ 113 °C). The TGA method offers a suitable 
alternative to conventional (direct) measurements of vapor pressure. 
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Phosphine and Arsine 
Because of their use in metal organic chemical vapor deposition (MOCVD) 


of 13-15 (III-V) semiconductor compounds phosphine (PH3) and arsine 
(AsHs3) are prepared on an industrial scale. 


Synthesis 

Phosphine (PH3) is prepared by the reaction of elemental phosphorus (P,) 
with water, [link]. Ultra pure phosphine that is used by the electronics 
industry is prepared by the thermal disproportionation of phosphorous acid, 
[link]. 

Equation: 


2P,+ 12H,O > 5PH; + 3H,PO, 
Equation: 
4H;PO; > PH; + 3H,PO, 
Arsine can be prepared by the reduction of the chloride, [link] or [link]. The 


corresponding syntheses can also be used for stibine and bismuthine. 
Equation: 


4 AsCl, + 3LiAIH, > 4 AsH, + 3 LiAICI, 
Equation: 


4 AsCl,+ 3NaBH, > 4 AsH,; + 3 NaCl + 3 BCI 


The hydrolysis of calcium phosphide or arsenide can also generate the 
trihydrides. 


Structure 


The phosphorus in phosphine adopts sp? hybridization, and thus phosphine 
has an umbrella structure ({link]a) due to the stereochemically active lone 
pair. The barrier to inversion of the umbrella (E, = 155 kJ/mol) is much 
higher than that in ammonia (E, = 24 kJ/mol). Putting this difference in 
context, ammonia’s inversion rate is 10!! while that of phosphine is 10°. As 
a consequence it is possible to isolate chiral organophosphines (PRR'R"). 
Arsine adopts the analogous structure ([link]b). 
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The structures of (a) phosphine 
and (b) arsine. 


Reactions 


Phosphine is only slightly soluble in water (31.2 mg/100 mL) but it is 
readily soluble in non-polar solvents. Phosphine acts as neither an acid nor 
a base in water; however, proton exchange proceeds via the phosphonium 
ion (PH,°) in acidic solutions and via PH>’ at high pH, with equilibrium 
constants K, = 4 x 10°°8 and K, = 41.6 x 10°2°, respectively. 


Arsine has similar solubility in water to that of phosphine (i.e., 70 mg/100 
mL), and AsH3 is generally considered non-basic, but it can be protonated 
by superacids to give isolable salts of ASH”. Arsine is readily oxidized in 
air, [link]. 
Equation: 


2 AsH,; + 30, > As,O, + 3H,O 


Arsine will react violently in presence of strong oxidizing agents, such as 
potassium permanganate, sodium hypochlorite or nitric acid. Arsine 
decomposes to its constituent elements upon heating to 250 - 300 °C. 


Gutzeit test 


The Gutzeit test is the characteristic test for arsenic and involves the 
reaction of arsine with Ag*. Arsine is generated by reduction of aqueous 
arsenic compounds, typically arsenites, with Zn in the presence of H»SOy,. 
The evolved gaseous AsH3 is then exposed to silver nitrate either as powder 
or as a Solution. With solid AgNO3, AsHs3 reacts to produce yellow 
Ag,AsNOs3, while with a solution of AgNO3 black Ag3As is formed. 


Hazards 


Pure phosphine is odorless, but technical grade phosphine has a highly 
unpleasant odor like garlic or rotting fish, due to the presence of substituted 
phosphine and diphosphine (P)H,). The presence of PH, also causes 
spontaneous combustion in air. Phosphine is highly toxic; symptoms 
include pain in the chest, a sensation of coldness, vertigo, shortness of 
breath, and at higher concentrations lung damage, convulsions and death. 
The recommended limit (RL) is 0.3 ppm. 


Arsine is a colorless odorless gas that is highly toxic by inhalation. Owing 
to oxidation by air it is possible to smell a slight, garlic-like scent when 
arsine is present at about 0.5 ppm. Arsine attacks hemoglobin in the red 
blood cells, causing them to be destroyed by the body. Further damage is 
caused to the kidney and liver. Exposure to arsine concentrations of 250 
ppm is rapidly fatal: concentrations of 25 — 30 ppm are fatal for 30 min 
exposure, and concentrations of 10 ppm can be fatal at longer exposure 
times. Symptoms of poisoning appear after exposure to concentrations of 
0.5 ppm and the recommended limit (RL) is as low as 0.05 ppm. 
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Mechanism of the Metal Organic Chemical Vapor Deposition of Gallium 
Arsenide 


Introduction 


Preparation of epitaxial thin films of III-V (13-15) compound 
semiconductors (notably GaAs) for applications in advanced electronic 
devices became a realistic technology through the development of metal 
organic chemical vapor deposition (MOCVD) processes and techniques. 
The processes mainly involves the thermal decomposition of metal alkyls 
and/or metal hydrides. 


In 1968 Manasevit at the Rockwell Corporation was the first to publish on 
MOCVD for the epitaxial growth of GaAs. This followed his pioneering 
work in 1963 with the epitaxial growth of silicon on sapphire. The first 
publication used triethylgallium [Ga(CH»CH3)3] and arsine (AsH3) in an 
open tube with hydrogen as the carrier gas. Manasevit actually coined the 
phrase MOCVD and since this seminal work there have been numerous 
attempts to improve and expand MOCVD for the fabrication of GaAs. 


Several processes, partly in series, partly in parallel take place during the 
growth by CVD. They are presented schematically in [link]. The relative 
importance of each of them depends on the chemical nature of the species 
involved and the design of the reactor used. The actual growth rate is 
determined by the slowest process in the series of events needed to come to 
deposition. 


[missing_resource: GaAs Fig 1.jpg] 


Schematic representation of 
the fundamental transport and 
reaction steps underlying 
MOCVD. Adapted from K. F. 
Jensen and W. Kern, in Thin 
Film Processes II, Eds. J. L. 
Vossen and W. Kern, 


Academic Press, New York 
(1991). 


Conventionally, the metal organic chemical vapor deposition (MOCVD) 
growth of GaAs involves the pyrolysis of a vapor phase mixture of arsine 
and, most commonly, trimethylgallium [Ga(CH3)3, TMG] and 
triethylgallium [Ga(CH»CH3)3, TEG]. Traditionally, growth is carried out in 
a cold-wall quartz reactor in flowing H> at atmospheric or low pressure. 
The substrate is heated to temperatures 400 - 800 °C, typically by RF 
heating of a graphite susceptor. Transport of the metal-organics to the 
growth zone is achieved by bubbling a carrier gas (e.g., H>) through the 
liquid sources that are in held temperature-controlled bubblers. 


Reaction mechanism 


While the overall reaction (where R = CH3 or CH»CH3) can be described 
by [link]. 
Equation: 


R,Ga + AsH,; — GaAs + 3 RH 


The nature of the reaction is much more complex. From early studies it was 
thought that free Ga atoms are formed by pyrolysis of TMG and As, 
molecules are formed by pyrolysis of AsH3 and these species recombine on 
the substrate surface in an irreversible reaction to form GaAs. 


Although a Lewis acid-base complex formed between TMG and AsHs3 is 
possible, it is now known that if there is any intermediate reaction between 
the TMG and AsHs, the product is unstable. However, early work indicated 
that free GaAs molecules resulted from the decomposition of a TMG.AsH3 
intermediate and that the heated surface contributed to the reaction. It was 
subsequently suggested that the reaction occurs by separate pyrolysis of the 
reactants and a combination of individual Ga and As atoms at the surface or 
just above it. Finally, evidence has also been found for TMG pyrolysis 


followed by diffusion through a boundary layer and for AsH3 pyrolysis 
catalyzed by the GaAs surface. 


There are several different kinds of potential reactions occuring in the CVD 
reaction chamber, namely, ligand dissociation, ligand association, , 
reductive elimination, oxidative addition, B-hydride elimination, etc. Some 
of them are listed in the following equations: 

Equation: 


Ga(CH;); —> Ga(CH;), + CH, (ligand dissociation) 


Equation: 


Ga(CH;), — Ga + CH,-CH, (reductive elimination) 


Equation: 


Ga(CH;), —> Ga(CH;) + CH,-CH, (reductive elimination) 


Equation: 


CH,+H — CH, (radical recombination) 


Equation: 


Ga-CH,-CH, —> Ga-H + H,C=CH, (6-hydride elimination) 


Equation: 


Ga(CH;), + AsR,; — (CH;),Ga-AsR, (ligand association) 


Using ALE studies as insight for MOCVD 


Given the stepwise and presumably simplified mechanism for atomic layer 
epitaxy (ALE) growth of GaAs, a number of mechanistic studies have been 
undertaken of ALE using TMG and AsH;3 to provide insight into the 
comparable MOCVD reactions. Nishizawa and Kurabayashi proposed that 
a CH3-terminated GaAs surface inhibits further heterogeneous 
decomposition of TMG and self-limits the growth rate to one 
monolayer/cycle. While, X-ray photoelectron spectroscopy (XPS) studies 
showed that no carbon was observed on a GaAs surface reacted with TMG. 
Furthermore, the same self-limiting growth was seen in in ALE using a 
metalorganic molecular beam epitaxy (MOMBE) with TMG and AsHsz. It 
was reported that a transient surface reconstruction is observable by 
reflection high-energy electron diffraction (RHEED) during the ALE of 
GaAs in MOMBE. It was suggested that this structure is caused by CH3- 
termination and the self-limitation of the growth rate is attributed to this 
structure. However, measurement of the desorption of CH3 by means of a 
combination of pulsed molecular beams and time-resolved mass 
spectrometry, indicates that CH3 desorption is too fast to attribute the self- 
limitation to the CH3-terminated surface. Subsequently, investigations of 
the pyrolysis of TMG on a (100)GaAs surface by the surface photo- 
absorption method (SPA) allowed for the direct observation of CH3 
desorption from a GaAs surface reacted with TMG. From the measured 
CH3 desorption kinetics, it was shown that the CH3-terminated surfaces 
causes the self-limitation of the growth rate in ALE because the excess 
TMG cannot adsorb. 


All this research helped people to visualize the real reaction mechanism in 
the formation of GaAs by MOCVD methods, in which the decomposition, 
diffusion and surface reaction interact with each other and result in a much 
more complicated reaction mechanism. 


Gas phase reaction: pyrolysis of TMG and AsH3 


In the TMG/Hp system, there is almost no reactions at a temperature below 
450 °C, whereas the reaction of TMG with H> almost completely changed 
into CH, and Ga at a temperature above 600 °C, [link]. 

Equation: 


Ga(CH;); + 7/3H, > Ga + 3CH, 


As for the AsH3 decomposition, without any deposition of Ga or GaAs in 
the reactor, the pyrolysis of AsH3 proceeded barely at a temperature below 
600 °C, however, it proceeded nearly completely at a temperature above 
750 °C. In the AsH3/H> system with the TMG introduced previously, the 
decomposition of AsH3 was largely enhanced even at a temperature below 
600 °C. The decomposition of AsH3 seems to be affected sensitively by the 
deposited GaAs or Ga. This phenomenon may be concluded to be caused by 
the catalytic action by GaAs or Ga. The reaction at a temperature below 600 
°C can be described as shown in [link], but at a temperature above 600 °C, 
pyrolysis of AsH3 can occur even without GaAs or Ga, [link]. 

Equation: 


GaAs 
ASH3 (ag) = AS (ag) + eae 


Equation: 


AsH, — As + 37/,H, 


Adsorption and surface reactions 


From the temperature dependent measurements of the desorption spectrum 
from a surface on which TMG was supplied, it was estimated that the 
surface-adsorbed species was Ga at the high temperature region of T,,, > 
500 °C, GaCHs3 at the range of 350 °C < T,,, < 500 °C, and Ga(CH3), and 
Ga(CH)3)3 at the range of T.,, < 350 °C. The reactions, where (ad) means 
the adsorbed state of the molecules, are: 

Equation: 


< 350 °C) 


sub 


Equation: 


Ga(CHs)3(.) > ~Ga(CHg)5'(aa) (Ty, < 350 °C) 
Equation: 

Ga(CHs)3(.) > GaCHy,,3, + 2 CH, (350°C <T,,, < 500 °C) 
Equation: 

Ga(CH)3(.) > Gaia + 3 CH, (Tu, > 500 °C) 


When AsHs3 is supplied, the reactions with these adsorbates are: 
Equation: 


Ga(CH;), + AsH, — no reaction (Tsu, < 350 °C) 
Equation: 
Ga(CH3)3 (a4) + ASH; —> no reaction (Tun < 350 °C ) 


Equation: 

GaCHy (aq) + ASH; -> GaAs + CH, + H, (350<T,,, < 500°C) 
Equation: 

Gag) + ASH; — GaAs + 7/,H, (T,,,, > 500 °C) 


It was observed that there is no growth in the range of T,,,, < 350 °C, i.e., 
Ga(CH3)9‘(aq) and Ga(CH3)3/aq) do not react with AsH3 in the TMG-AsH3 
system. Monomolecular layer growth is limited by the formation of GaCH3 
and its reaction with AsH3. 


Overall reaction pathway 


At lower temperature (350 - 500 °C), equivalently low energy, TMG 
decompose in the gas phase to Ga(CH3)5 and methyl radical, [Link]. 
Equation: 


Ga(CH;), —> Ga(CH;), + CH, (low energy, gas phase) 


After the first ligand dissociation, there are two different pathways, in the 
first, the Ga(CH3) keeps decomposing into GaCH3 and another methyl 
group when it is at the gas-substrate interface, [link], and then further 
decomposes into free gallium atoms on the substrate surface, [link]. In the 
second reaction, the Ga(CH3). decomposes directly into Ga and CH3-CH3 
by reductive elimination, [link]. 


Equation: 

Ga(CH;), —> GaCH, + 2 CH, (gas/surface) 
Equation: 

GaCH, — Ga + CH, (surface) 
Equation: 

Ga(CH;), — Ga + H,C-CH, (reductive elimination) 


At high temperature (> 500 °C), the TMG decomposes into Ga(CH3) and 
two methyl groups instead of the step-wise decomposition at lower 
temperature, [link], and the Ga(CH3) further decomposes into free Ga atoms 
at the substrate surface, [link]. 

Equation: 


Ga(CH;), — GaCH, + 2 CH, (high temperature, gas phase) 


Equation: 


GaCH, — Ga + CH, 


The decomposition of AsH3 forms an “arsenic cloud” in the reaction 
chamber. The decomposition is also step-wise: 
Equation: 


AsH,; — AsH, 


Equation: 


AsH, — HAs + H (surface) 


Equation: 


HAs — As + H (surface) 


The methyl groups in the surface Ga(CH3) molecules are removed by the 
formation of methane with atomic hydrogen from the decomposition of 
AsHs3, [Link]. 

Equation: 


H + CH, — CH, (surface) 


Kinetics for other systems 


Investigations have been reported for the mechanism of the growth of GaAs 
using triethylgallium [Ga(CH»CH3)3, TEG] and TMG with trimethylarsene 
[As(CH3)3, TMA], triethylarsene [As(CH»CH3)3, TEA], tert-butylarsine 

{[(CH3)3C]AsH», TBA}, and phenylarsine [(CgH,)AsH>]. The experiments 


were conducted in a MOCVD reactor equipped with a recording 
microbalance for in-situ growth rate measurements. For example, the 
kinetics of the growth of GaAs were investigated by measuring growth rate 
as a function of temperature using the microbalance reactor while holding 
the partial pressure of gallium precursor (e.g., TMG) and arsenic precursor 
[e.g., As(CH3)3] constant at 0.01 and 0.05 Torr, respectively. Three different 
flow rates were used to determine the influence of the gas residence time. 


The growth rate of GaAs with TMG and As(CH»CH3)3 is higher as 
compared with the growth from TMG and As(CHs3)3 because of the lower 
thermal stability of As(CH»CH3)3 than As(CH3)3. Both of the two growth 
rates showed a strong dependence on the residence time. 


Similarly, the kinetic behaviors of the TMG/TBA and TEG/TBA system 
were investigated under the same conditions as the TMA and TEA studies. 
There are two distinct regions of growth. For TMG/TBA, the deposition 
rate is independent at low temperature and in the intermediate temperature 
(around 600 °C) the dependence of the growth rate on the total flow rate is 
significant. This means that the growth at the lower temperature is 
controlled by surface reactions. The TEG/TBA system showed a similar 
behavior except that the maximum growth rate occurs around 450 °C while 
it is around 750 °C for TMG/TBA system. Also, the growth of 
TMG/(CgH;)AsH> was studied on the same conditions as for the 
Me3Ga/‘BuAsH, system. It was reported that the difference in the growth 
rate at various flow rates was related to a combination of parasitic reactions 
and depletion effects from deposition. From the comparison of the data, it is 
deduced that the effect of parasitic reactions is slightly smaller for 
(CgHs)AsH> than for TBA. 


Two possible mechanisms for the dependence of growth rate on flow rate 
were proposed. The first, mass-transfer limitation was thought to be 
unlikely because of the high diffusivity of the gallium precursors at 1 Torr 
(ca. 350 cm?/s). The second, also the more likely explanation for the 
observed growth-rate dependence on flow rates is gas-phase depletion cause 
by the parasitic reactions. Since the growth efficiency is high (41% at 700 
°C), the loss of precursor from the gas phase will directly affect the growth 
rate. It was evidenced by the differences in the growth rates between split 


and combined feed streams. The growth rate is lower when the reagents are 
combined upstream of the reactor than when they are combined inside the 
reactor (split stream). It is suggested that the experimental observations can 
be explained by a model based on the reversible formation of an adduct and 
the decomposition of this adduct to useless polymeric material competing 
with the growth of GaAs. It can be written in the form shown in [link] 
where ky and k, are the forward and reverse rate constants for adduct 
formation, respectively, kg is the rate constant for the irreversible 
decomposition of the adduct to polymer, and k, is the surface reaction rate 
constant for the growth of GaAs. It is obvious that each step involves 
several elementary reactions, but there were insufficient data to provide any 
more detail. 

Equation: 


ky d 
GaAs <— organometallic precursors === adduct — polymeric deposits 
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Chemical Vapor Deposition of Silica Thin Films 


General considerations 


Before describing individual chemical vapor deposition (CVD) systems for the deposition of 
silica thin films, it is worth outlining general considerations to be taken into account with regard 
to the growth by CVD of any insulating film: the type of CVD method, deposition variables, 
and limitations of the precursor. 


Deposition methods 


In regard to the CVD of insulating films in general, and silica films in particular, three general 
reactors are presently used: atmospheric pressure CVD (APCVD), low and medium temperature 
low pressure CVD (LPCVD), and plasma-enhanced CVD (PECVD). LPCVD is often further 
divided into low and high temperatures. 


APCVD systems allow for high throughput and even continuous operation, while LPCVD 
provides for superior conformal step coverage and better film homogeneity. PECVD has been 
traditionally used where low temperatures are required, however, film quality is often poor. As 
compared to PECVD, photo-assisted CVD has the additional advantage of highly selective 
deposition, although it has been little used in commercial systems. [link] summarizes the 
advantages and disadvantages of each type of CVD system commercially used for SiO, films. 


Atmospheric Low Medium Plasma 
pressure temperature temperature enhanced 
CVD LPCVD LPCVD CVD 
a 300 - 500 300 - 500 500 - 900 100 - 350 
Throughput high high high low 
aap poor poor conformal poor 
coverage 
igs : good good excellent poor 
properties 
iisés passivation, passivation, ‘cenlaiian passivation, 


insulation insulation insulation 


Comparison of different deposition methods for SiO» thin films. 


Deposition variables 


The requirements of CVD films for electronic device applications have become increasingly 
more stringent as device sizes are continually reduced. Film thickness must be uniform across 
an entire wafer, i.e., better than +1%. The structure of the film and its composition must be 
controlled and reproducible, both on a single wafer, as well as between wafer samples. It is also 
desirable that the process is safe, inexpensive, and easily automated. 


A number of variables determine the quality and rate of film growth for any material. In general, 
the deposition rate increases with increased temperature and follows the Arrhenius equation, 
[link], where R is the deposition rate, E, is the activation energy, T is the temperature (K), A is 
the frequency factor, and k is Boltzmann's constant (1.381 x 10°23 J/K). 

Equation: 


R = A exp(-E,/kT) 


At the high temperatures the rate of deposition becomes mass transport limited. Meaning, the 
rate of surface reaction is faster than the rate at which precursors are transported to the surface. 
In multiple source systems, the film growth rate is dependent on the vapor phase concentration 
(or partial pressure) of each of the reactants, but in certain cases the ratio of reactants is also 
important, e.g., the SiH,/O» growth of SiO». Surface catalyzed reactions can also alter the 
deposition rate. Such as the non-linear dependence of the deposition rate of SiO on the partial 
pressure of Si(OEt),4. Gas depletion may also be significant requiring either a thermal ramp in 
the chamber and/or special reactor designs. The necessary incorporation of dopants usually 
lowers deposition rates, due to competitive surface binding. 


For the applications of insulating materials as isolation layers, an important consideration is step 
coverage: whether a coating is uniform with respect to the surface. [link]a shows a schematic of 
a completely uniform or conformal step coverage of a trench (such as occurs between isolated 
devices) where the film thickness along the walls is the same as the film thickness at the bottom 
of the step. Uniform step coverage results when reactants or reactive intermediates are able to 
migrate rapidly along the surface before reacting. When the reactants adsorb and react without 
significant surface migration, deposition is dependent on the mean free path of the gas. [link]b 
shows an example of minimal surface migration and a short mean free path. For SiO» film 
growth LPCVD has highly uniform coverage ([link]a) and PECVD poor step coverage ([link]b). 


film ———~> 


substrate ——> 


(a) (b) 


Step coverage of deposited films with (a) uniform 
coverage resulting from rapid surface migration and (b) 
nonconformal step coverage due to no surface migration. 


Precursor considerations 


The general requirements for any CVD precursor have been adequately reviewed elsewhere, and 
will not be covered here. However, many of the gases and organometallics used to deposit 
dielectric films are hazardous. The safety problems are more severe for LPCVD because the 
process often uses no diluent gas such as argon or nitrogen. [link] lists the boiling point and 
hazards of common inorganic and organometallic precursor sources for CVD of SiO» and doped 
silica. Many of the precursors react with air to form solid products, thus leaks can cause 
particles to form in the chamber and gas lines. 


Gas Formula Bpt (°C) Hazard 
ammonia NH3 -33.35 toxic, corrosive 
argon Ar -185.7 inert 

arsine AsH3 -55 toxic 

diborane BoHg -92.5 toxic, flammable 
dichlorosilane SiCl)H> 8.3 toxic, flammable 
hydrogen Hy -252.8 flammable 


nitrogen N> -209.86 inert 


nitrous oxide N»O -88.5 oxidizer 

oxygen Oz -182.962 oxidizer 

phosphine PH3 -87.7 toxic, P2H, impurities, flammable 
silane SiH, -111.8 flammable, toxic 


Physical and hazard properties of common gaseous sources for CVD of dielectric materials. 


In principle, the deposition of a SiO», or silica, thin film by CVD requires two chemical sources: 
the element (or elements) in question, and an oxygen source. While dioxygen (O3) is suitable for 
many applications, its reactions may be too fast or too slow for optimum film growth, requiring 
that alternative oxygen sources be used, e.g., nitrous oxide (N»O) and ozone (O3). A common 
non-oxidizing oxygen source is water. A more advantageous approach is to incorporate oxygen 
into the ligand environment of the precursor, and endeavor to preserve such an interaction intact 
from the source molecule into the ultimate film; such a source is often termed a "single-source" 
precursor. 


CVD silica (SiO,) 


The processing sequence for silicon dioxide (SiO») used depends on its specific use. CVD 
processes for SiO» films can be characterized by either the chemical reaction type, the growth 
pressure, or the deposition temperature. The choice of route is often dictated by requirements of 
the thermal stability of the substrate or the conformality. [link] summarizes selected properties 
of SiO» grown by various CVD methods, in comparison to that of thermally grown silica. In 
general, silica grown at high temperatures resemble thermally grown “native” SiO». However, 
the use of aluminum metallization requires low temperature deposition of silica. 


SiCI,H> 


Deposition Plasma O, Si(OEt), +N,0 Thermal 
ae 200 450 700 900 1000 
Composition SiO, 9(H) SiO»(H) SiO» SiO2(Cl) SiO» 

= ee es ae conformal conformal conformal 
Thermal loses H densifies stable loses Cl stable 


stability 


Refractive 


1.47 1.44 1.46 1.46 1.46 
idex 

Dielectric 49 43 4.0 4.0 Be, 
constant 


Comparison of physical properties of SiO) grown by commercial CVD methods. 


CVD from hydrides 


The most widely used method for SiO, thin film CVD is the oxidation of silane (SiH), first 
developed in 1967 for APCVD. Nonetheless, LPCVD systems have since become increasingly 
employed, and exceptionally high growth rates (30,000 A/min) have been obtained by the use of 
rapid thermal CVD. 


The chemical reaction for SiO» deposition from SiH, is: 
Equation: 


SiH, + O, > SiO, + 2H, 


At high oxygen partial pressures an alternative reaction occurs, resulting in the formation of 
water. 
Equation: 


SiH, + 20, > SiO, + 2H,O 


While these reactions appears simple, the detailed mechanism involves a complex branching- 
chain sequence of reactions. The apparent activation energy is low (< 41 kJ/mol) as a 
consequence of its heterogeneous nature, and involves both surface adsorption and surface 
catalysis. 


Nitrous oxide (N2O) can be used as an alternative oxygen source to O2, according to the overall 
reaction, [link]. 
Equation: 


SiH, + 2N,O > SiO, + 2H, + 2N, 


A simple kinetic scheme has been developed to explain many of the observed aspects of SiHy- 
NO growth. It was suggested that the reaction is initiated by decomposition of NO, [link], 
generating an oxygen radical which can abstract hydrogen from silane forming a hydroxyl 
radical, [link], that can react further with silane, [link]. 


Equation: 


N,0 > N, +0 


Equation: 


SiH, + O > SiH, + OH 


Equation: 


SiH, + OH > SiH, + H,O 


Evidence for the reaction of the OH radical to form water is the formation of a small quantity of 
water observed during the oxidation of SiH4. Silyl radicals are oxidized by N2O to form siloxy 
radicals, [link], which provide a suitable propagation step, [link]. 

Equation: 


SiH, + N,O > SiH,O +N, 


Equation: 


SiH,O + SiH, > SiH,OH + SiH, 


It has been proposed that the silanol (SiH3OH) is the penultimate film precursor. 


The SiHy-O> and SiH,-N>O routes to SiO, thin films are perhaps the most widely studied 
photochemical CVD system of all dielectrics. Photo-C VD of SiO, provides a suitable route to 
deposition at low substrate temperatures, thereby avoiding potential thermal effects of wafer 
warpage and deleterious dopant redistribution. In addition, unlike other low temperature 
methods such as APCVD and PECVD, photo-CVD often provides good purity of films. 


A summary of common silane CVD systems is given in [link]. 


Oxygen Carrier gas CVD Deposition Growth rate 
source (diluent) method temp. (°C) (A/min) 
O> N> APCVD 350 - 475 100 - 14,000 


Oz Ar LPCVD 100 - 550 100 - 30,000 


O» At/N> LPCVD 25 - 500 10 - 450 


O, Ar PECVD 25 - 200 200 - 900 
N,O N> APCVD 490 - 690 200 - 1,200 
N,O N> LPCVD 700 - 860 ca. 50 

N,O N> LPCVD 25 - 350 7 - 180 
N,O Ar PECVD 100 - 200 80 - 800 


Precursors and deposition conditions for SiO, CVD using silane (SiH). 


CVD from halides 


The most widely used process of the high temperature growth of SiO) by LPCVD involves the 
N>O oxidation of dichlorosilane, SiC] )Hb, [link]. 
Equation: 


SiCL,H, + 2N,O > SiO, + 2HCI + 2N, 


Deposition at 900 - 915 °C allows for growth of SiO, films at ca. 120 A/min; however, these 
films are contaminated with Cl. Addition of small amounts of O» is necessary to remove the 
chlorine. 


While PECVD has been employed utility halide precursors, the ability of small quantities of 
fluorine to improve the electrical properties of SiO) has prompted investigation of the use of 
SiF,4 as a suitable source. 


CVD from tetraethoxylsilane (TEOS) 


The first CVD process to be introduced into semiconductor technology in 1961 was that 
involving the pyrolysis of tetraethoxysilane, Si(OEt)4 (commonly called TEOS from 
tetraethylorthosilicate). Deposition occurs at an optimum temperature around 750 °C. However, 
under LPCVD conditions, the growth temperature can be significantly lowered (> 600 °C). The 
high temperature growth of SiO» from TEOS involves no external oxygen source. Dissociative 
adsorption studies indicate that decomposition of the TEOS-derived surface bound di- and tri- 
ethoxysiloxanes is the direct source of the ethylene. 


PECVD significantly lowers deposition temperatures using TEOS, but requires the addition of 
O, to remove carbon contamination, via the formation of gaseous CO and CO», which are 


subsequently not incorporated within the film. Although deposition as low as 100 °C may be 
obtained, the film resistivity increases by three orders of magnitude by depositing at 200 °C; 
being 10!° Q.cm, with a breakdown strength of 7 x 10° V/cm. 


Addition of O) for APCVD growth does not decrease the deposition temperature, however, if 
ozone (O3) is used as the oxidation source, deposition temperatures as low as 300 °C may be 
obtained for uniform crack-free films. It has been postulated that the ozone traps the TEOS 
molecule on the surface as it reacts with the ethoxy substituent, providing a lower energy 
pathway (TEOS-O3 @ 55 kJ/mol versus TEOS-O2 @ 230 kJ/mol and TEOS only @ 190 
kJ/mol). 


There are significant advantages of the TEOS/O3 system, for example the superior step 
coverage it provides. Furthermore, films have low stress and low particle contamination. On this 
basis the TEOS/O3 system has become widely used for silica, as well as silicate glasses. 


CVD from other organosilicon precursors 


A wide range of alternative silicon sources has been investigated, especially with regard to 
either lower temperature deposition and/or precursors with greater ambient stability. 


Diethylsilane (Et)SiH>), 1,4-dislabutane (DBS, H3SiCH CH >SiHs3), 2,4,6,8- 
tetramethylcyclotetrasiloxane (TMCTS, [link]a, where R = CH3), and 2,4,6,8- 
tetraethylcyclotetrasiloxane (TECTS, [link]a, where R = CjHs), have been used in conjunction 
with O, over deposition temperatures of 100 - 600 °C, depending on the precursor. Diacetoxydi- 
tert-butyl silane (DADBS, [link]b) has been used without additional oxidation sources. High 
quality silicon oxide has been grown at 300 °C by APCVD using the amido precursor, 
Si(NMe»)4 ([link]c). 


H 
R, | 
“Si H i 
a = ZN 
(0) SimR HC Oe (CH3).N 
| : gy:z OC(CHs)s \ gyn! N(CH3)2 
Re Si fe) r "* OC(CH3)3 7) N(CH) 
rr ia ne H3C. Pasi (CH3)2N 
% II 
R O 
(a) (b) (c) 
H H 
‘si—_ o— Si 
HL SK ! , | 
sors | 
Si-|-O— Si 
? St i oS ik 
ly 7 
Si——0 “Si 
a” \ 
H 


Alternative organometallic silicon sources that have been 
investigated for the growth of silica thin films. 


An interesting concept has been to preform the -Si-O-Si- framework in the precursor. In this 
regard, the novel precursor Tg-hydridospherosiloxane (HgSigO1p, [link]d) gives smooth 
amorphous stoichiometric SiO» at 450 - 525 °C by LPCVD. The decomposition mechanism in 
the presence of added oxygen involves the loss of water, [link]. IR studies indicate that the Si-O- 
Si bonds are preserved during deposition. While films are of high quality, the present synthesis 
of HgSigO4> is of low yield (ca. 21%), making it currently impractical for large scale processing. 
Equation: 


H,Si0,. + 40, > 8SiO, + 4H,O 


CVD silicate glasses 


Borosilicate glasses (BSG), phosphosilicate glasses (PSG) and borophosphosilicate glasses 
(BPSG) are frequently used as insulating layers separating conducting layers. These glasses 
have lower intrinsic stress, lower melting temperatures and better dielectric properties than SiOz 
itself. PSG and BPSG have the added property of gettering and immobilizing dopants. 
Particularly important is the gettering of sodium ions, which are a source of interface traps. The 
low temperature molten properties of BSG, PSG, and BPSG glasses allow for the smoothing of 
the device topography by viscous thermal fusion to convert abrupt steps to more gradually 
tapered steps ([link]a) as well as planarization of complex topologies ([link]b), enabling 
deposition of continuous metal layers. This process is commonly called P-glass flow. The boron 


and phosphorous contents of the silicate glasses vary, depending on the application, typically 
being from 2 to 8 weight per cent. 


BPSG 


metallization 


Si-substrate 


(a) (b) 


Schematic cross section of BPSG as deposited (a) and after 
annealing (b), showing the flow causing a decrease in the angle 
of the BPSG going over the step. 


The advantage of BPSG over PSG is that flow occurs over the temperature range of 750 - 950 
°C, depending on the relative P and B content (as opposed to 950 - 1110 °C for PSG). Lowering 
of the flow temperature is required to minimize dopant migration in VLSI devices. Conversely, 
the disadvantages of BPSG versus PSG include the formation of bubbles of volatile 
phosphorous oxides and crystallites of boron-rich phases. If, however, the dopant concentration 
is controlled, these effects can be minimized. 


Arsenosilicates (AsSG) were employed originally in silicon device technology as an arsenic 
dopant source for planar substrates prior to the advent of large scale ion implantation which has 
largely removed the need for AsSG in doping applications. However, with ULSI silicon circuit 
fabrication, the requirement for doping of deep trenches (inaccessible to ion implantation) has 
witnessed the re-emergence of interest in AsSG films. 


The CVD growth of silicate glasses follows that of SiO, with SiH, and TEOS being the most 


commonly employed silicon precursors. A summary of common CVD precursor systems for 
silicate glasses is given in [link]. 


CVD Deposition temp. 


Precursors “aethod (°C) Applications 
SiH,/B>Hg APCVD 300 - 450 good step 
coverage 


SiH,/B>H, LPCVD 350 - 400 : 


SiH,/PH; APCVD 300 - 450 : 


SiH,/PH3 LPCVD 350 - 400 flow glass 
SiH4/B>H¢/PH3 APCVD 300 - 450 - 
SiH4/B>H¢/PH3 LPCVD 350 - 400 - 
SiH4/AsH3 APCVD 500 - 700 - 
TEOS/B(OMe)3 APCVD 650 - 730 ee 
source 
TEOS/B(OMe)3 LPCVD 500 - 750 trench filling 
TEOS/B(OEt)3 APCVD 475 - 800 oon 
source 
TEOS/B(OEt)3 LPCVD 500 - 750 eon 
source 
TEOS/PH3 LPCVD 650 flow glass 
TEOS/O=P(OMe)3 APCVD 300 - 800 flow glass 
diffusion 
TEOS/P(OMe)3 LPCVD 500 - 750 
source 
TEOS/O=P(OMe)3 LPCVD 500 - 800 flow glass 
TEOS/B(OMe):3/PH3 LPCVD 620 - 800 trench filling 
TEOS/B(OMe)3/P(OMe)3 LPCVD 675 - 750 flow glass 
TEOS/B(OMe)3/O=P(OMe)3 LPCVD 680 flow glass 
diffusion 
TEOS/AsCls APCVD 500 - 700 
source 
TEOS/As(OEt)3 LPCVD 700 - 730 trench doping 
TEOS/O=As(OEt)3 LPCVD 700 - 730 trench doping 


Precursors and deposition conditions for CVD of borosilicate glass (BSG), phososilicate glass 
(PSG), borophosphosilicate glass (BPSG) and arsenosilicates (AsSG) thin films. 


CVD from hydrides 


Films of BSG, PSG, and BPSG may all be grown from SiHy, O» and B>Hg¢ and/or PHs, at 300 - 
650 °C. For APCVD, the reactants are diluted with an inert gas such as nitrogen, and the 
O,/hydride molar ratio is carefully controlled to maximize growth rate and dopant concentration 
(values of 1 to 100 are used depending on the application). Ordinarily, the dopant concentration 
for both BSG and PSG decreases with increased temperature. However, some reports indicate 
an increase in boron content with increased temperature. Film growth of BPSG was found to 
occur in two temperature regions. Deposition at low temperature (270 - 360 °C) occurred via a 
surface reaction rate limiting growth (E, = 39 kcal/mol), while at higher temperature (350 - 450 
°C), a mass-transport rate limited reaction region is observed (E, = 7.6 kcal/mol). 


LPCVD of BSG and PSG is conducted at 450 - 550 °C with an O»:hydride ratio of 1:1.5. 
Conversely, an Oy:hydride ratio of 1.5:1 provides the optimum growth conditions for BPSG 
over the same temperature range. The phosphorous in PSG films was found to exist as a mixture 
of P»Os and P»03, however, the latter can be minimized under the correct deposition conditions. 
Some difficulties have been reported for the use of B>Hg due to its thermal instability. 
Substitution of ByHg with BCl3 obviates this problem, although the resulting films are 
invariably contaminated with 1 weight per cent chloride. 


Arsenosilicate glass (AsSG) thin films are generally grown by APCVD using arsine (AsH3); the 
use of which is being limited due to its high toxicity. However, arsine inhibits the gas phase 
reactions between SiH, and Oy, such that film grown from SiH4/AsH3/O> show improved step 
coverage at high deposition rates. 


CVD from metal organic precursors 


As with SiO, deposition, see above, there has been a trend towards the replacement of SiH, with 
TEOS on account of its ability to produce highly conformal coatings. This is particularly 
attractive with respect to trench filling. Furthermore, films of doped SiO» glasses have been 
obtained using both APCVD and LPCVD (typically below 3 Torr), with a wide variety of 
dopant elements including: boron, phosphorous, and arsenic, including antimony, tin, and zinc. 


Boron-containing glasses are generally grown using either trimethylborate, B(OMe)s, or 
triethylborate, B(OEt)3, although the multi-element source, tris(trimethylsilyl)borate, 
B(OSiMe3)3, has been employed for both silicon and boron in BPSG thin film growth. 
Similarly, whereas PH3 may be used as the phosphorous source, trimethylphosphite, P?COMe)s, 
and trimethylphosphate, O=P(OMe)s, are preferred. Likewise, triethoxyarsine, As(OEt)3, and 
triethylarsenate, O=As(OEt)3, have been employed for AsSG growth. 


The co-reaction of TEOS with organoboron and organophosphorous compounds allows for 
deposition at lower temperatures (500 - 650 °C) than for hydride growth of comparable rates. 
However, LPCVD, using an all organometallic approach, requires P?(OMe)3 because the low 
reactivity of O=P(OMe)3 prevents significant phosphorus incorporation. Although premature 
decomposition of P}(OMe)3 occurs at 600 °C (leading to non-uniform growth), deposition at 550 
°C results in high film uniformity at reasonable deposition rates. 
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Chemical Vapor Deposition of Alumina 


Alumina 


Alumina, Al)O3, exists as multiple crystalline forms, however, the two most important are 
the a and y forms. a-Al,O3 (corundum) is stable at high temperatures and its structure 
consists of a hexagonal close-packed array of oxide (O*”) ions with the Al°* ions 
occupying octahedral interstices. In contrast, y-Al,O3 has a defect spinel structure, readily 
takes up water and dissolves in acid. Despite the potential disadvantages of y-Al,O3 there 
is a preference for its deposition on silicon substrates because of the two different lattice- 
matching relationships of y-Al)O3 (100) on Si(100). These are shown as schematic 
diagrams in [link]. A summary of CVD precursor systems for Al»O3 is given in [link]. 


i. 


y-Al203 


(a) (b) 


Schematic diagram of the crystallographic relations of y- 

Al,O3 on Si(100): (a) y-Al5O3 (100)||Si(100), and (b) y- 

Al5O3 (100)||Si(110). Adapted from A. R. Barron, CVD 
of Non-Metals, W. S. Rees, Jr., Ed. VCH, New York 


(1996). 
Aluminum Oxygen Carrier CVD Deposition 
"i Comments 
precursor source gas method temp. (°C) 
AICls CO>/H> H> or APCVD 700 - 900 amorphous 


No (700), 


AlMe3 


AlMe3 


AlMe3 


AlMe3 


AlMe3 


Al(O'Pr)3 


Al(O'Pr)3 


Al(O'Pr)3 


Al(acac)3 


Al(acac)3 


Al(acac)3 


O» 


Op 


N,O 


air 


O» and 
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No 
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He 
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Np 


Np 


Ar 


APCVD 


LPCVD 
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LPCVD 


PECVD 


APCVD 


LPCVD 


LPCVD 


APCVD 


APCVD 


LPCVD 
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375 
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950 - 1050 


120 - 300 


420 - 600 


250 - 450 
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420 - 450 


250 - 600 
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crystalline 
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dep. rate 
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dependent 
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and O» 
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enhanced, 
10 W 
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quality 
than with 
Oz 
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passivation 
properties 
of Si MOS 
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enhanced, 


epitaxial 
on Si 


high C 
content 


significant 
C content 


growth rate 
indep. of 


H,O but 
film 
quality 
dep. on 
H,O 


Precursors and deposition conditions for Al,O3 CVD. 


CVD from halides 


The initial use of CO>/Hp> as a hydrolysis source for the CVD of SiO> from SiCly, led to the 
analogous deposition of Al»O3 from AICla, i.e., 
Equation: 


H, + CO, > H,O + CO 


Deposition in the temperature range 700 - 900 °C was found to yield films with optimum 
dielectric properties, but films deposited below 700 °C contained significant chloride 
impurities. It has been determined that H2O vapor, formed from Hy and COsy, acts as the 
oxygen donor, and not the CO). The crystal form of the CVD-grown alumina films was 
found to depend on the deposition temperature; films grown below 900 °C were y-Al)O3, 
while those grown at 1200 °C were a-Al,O3, in accord with the known phase diagram for 
this material. 


CVD from trimethylaluminum (TMA) 


Although trimethylaluminum, AlMe3 (TMA), reacts rapidly with water to yield Al)Os, the 
reaction is highly exothermic (-1243 kJ/mol) and thus difficult to control. The oxygen 
gettering properties of aluminum metal, however, can be employed in the controlled 
MOCVD growth of Al»O3. The common deposition conditions employed for CVD of 
Al,O3 from AlMe3 are similar to those used for aluminum-metal CVD, but with the 
addition of an oxygen source, either O> or N50. 


Films grown by APCVD using N>O are of inferior quality to those employing O>, due to 
their exhibiting some optical absorption in the visible wavelength region. The growth of 
high quality films using either oxygen source is highly dependent on the gas phase 
concentrations of aluminum and “oxygen”. Further improvements in film quality are 
observed with the use of a temperature gradient in the chambers deposition zone. 


Attempts to lower the deposition temperature employing PECVD have been generally 
successful. However, a detailed spectroscopic study showed that the use of NO as the 
oxygen source resulted in significant carbon and hydrogen incorporation at low 
temperatures (120 - 300 °C). The carbon and hydrogen contamination are lowered at high 
deposition temperature, and completely removed by a post-deposition treatment under O>. 
It was proposed that the carbon incorporated in the films is in the chemical form of Al-CH3 
or Al-C(O)OH, while hydrogen exists as Al-OH moieties within the film. 


Photo-assisted CVD of Al,O3 from AlMe3 has been reported to provide very high growth 
rates (2000 A/min) and give films with electrical properties comparable to films deposited 
using thermal or plasma techniques. Irradiation with a 248 nm (KrF) laser source allowed 
for uniform deposition across a 3" wafer. However, use of 193 nm (ArF) irradiation 
required dilution of the AlMe3 concentration to avoid non-uniform film growth. 


CVD from alkoxides and B-diketonates 


The pyrophoric nature of AlMe3 urged investigations into alternative precursors, in 
particular those which already contain oxygen. Alternative precursors might also provide 
possible routes to eliminate carbon contamination. Given the successful use of TEOS in 
SiO, thin film growth, an analogous alkoxide precursor approach is logical. The first report 
of Al,O3 films grown by CVD used an aluminum alkoxide precursors. 


Aluminum tris-iso-propoxide, Al(O'Pr)3, is a commercially available inexpensive alkoxide 
precursor compound. Deposition may be carried-out by either APCVD or LPCVD, using 
oxygen as an additional oxidation source to ensure low carbon contamination. It is 
adventitious to use LPCVD (10 Torr) growth to inhibit gas phase homogeneous reactions, 
causing formation of a powdery deposit. The use of lower chamber pressures (3 Torr) and 
N>O as the oxide source provided sufficient improvement in film quality to allow for 
device fabrication. 


The deposition of Al,O3 films from the pyrolysis of aluminum acetylacetonate, Al(acac)3 
([link]a), has been widely investigated using both APCVD and LPCVD. The perceived 
advantage of Al(acac)3 over other aluminum precursors includes lowered-toxicity, good 
stability at room temperature, easy handling, high volatility at elevated temperatures, and 
low cost. However, the quality of films was originally poor; carbon being the main 
contaminant resulting from the thermolysis and incorporation of acetone and carbon 
dioxide formed upon thermal decomposition ((link]). 


(a) (b) (c) 


Aluminum £-diketonate precursors. 


acetone 


200 250 300 350 400 
Temperature of Pyrolysis (°C) 


Gaseous decomposition products from the 
pyrolysis of Al(acac)3 as a function of 
pyrolysis temperature (Data from J. Von 
Hoene, R. G. Charles, and W. M. Hickam, J. 
Phys. Chem., 1958, 62, 1098). 


Incomplete oxidation of the film may be readily solved by the addition of water vapor to 
the carrier gas stream; pure carbon-free films being grown at temperatures as low as 230 
°C. In fact, water vapor plays an important role in the film growth kinetics, film purity, and 
the surface morphology of the grown films. While the growth rate is unaffected by the 
addition of water vapor, its influence on the surface morphology is significant. Films 
grown without water vapor on the Al,O3 surface is rough with particulates. In contrast, 
films grown with water vapor are mirror smooth. 


A systematic study of the kinetics of vaporization of Al(acac)3 along with fluorinated 
aluminum f-diketonate complexes, Al(tfac)3 ([link]b) and Al(hfac)s ([link]c), has been 
reported, and the saturation vapor pressures determined at 75 - 175 °C. 


Aluminum silicates 


The high dielectric constant, chemical stability and refractory character of aluminosilicates, 
(Al,03),(SiO2)y, makes them useful as packaging materials in IC chip manufacture. 
Mullite (3A1,03.2SiO>) prepared by sol-gel techniques, is often used as an encapsulant for 
active devices and thin-film components. Amorphous alumina-silica films have also been 
proposed as insulators in multilevel interconnections, since they do not suffer the 
temperature instability of alumina films retain the desirable insulating characteristics. 
Under certain conditions of growth and fabrication, silica may crystallize, thereby allowing 
diffusion of oxygen and impurities along grain boundaries to the silicon substrate 
underneath. Such unwanted reactions are catastrophic to the electronic properties of the 
device. The retention of amorphous structure over a larger temperature range of silicon rich 
alumina-silica films offers a possible solution to this deleterious diffusion. 


Thin films of mixed metal oxides are usually obtained from a mixture of two different 
kinds of alkoxide precursors. However, this method suffers from problems with 
stoichiometry control since extensive efforts must be made to control the vapor phase 
concentration of two precursors with often dissimilar vapor pressures. Also of import here 
is the near impossible task of matching rates of hydrolysis/oxidation to give "pure", non- 
phase segregated films, i.e., those having a homogeneous composition and structure. In an 
effort to solve these problems, research effort has been aimed at single-source precursors, 
i.e., those containing both aluminum and silicon. 


The first study of single-source precursors for (Al203),(SiO2), films employed the mono- 
siloxide complex Al(O'Pr):(OSiMes) ([link]a). However, it was found that except for 
deposition at very high temperatures (> 900 °C) the deposited films this mono-siloxide 
compound were aluminum-rich (AI/Si = 1.3 - 2.1) and thus showed thermal instability in 
the insulating properties caused by crystallization in the films. It would appear that in order 
for silicon-rich alumina-silica films to be grown more siloxane substituents are required, 
e.g., the tris-siloxy aluminum complex [Al(OSiEt3)3]» ([link]b). 


SiEt, SiEt; 


'PrOn., Ven 1 \O'Pr EnSiOv..,/” Ng 


ae a Sas pnOSiEt, 
iPpror WA ~oPr Sore NZ Osi 
SiEt, SiEt, 
(a) (b) 


Precursors for aluminum silicate thin films. 


The AI/Si ratio of thin films growth by APCVD using [AI(OSiEt3)3]5 at 420 - 550 °C, was 
found to be dependent on the deposition temperature and the carrier gas composition 
(O>/Ar). This temperature and oxygen-dependent variation in the film composition 
suggests that two competing precursor decomposition pathways are present. 


1. Deposition in the absence of Oy, is similar to that observed for the decomposition of 
Al(O'Pr),(OSiMe3) under Np, and would imply that the film composition is 
determined by the temperature-dependent tendencies of the Al-O-Si bonds to cleave. 

2. The temperature-independent oxidative decomposition of the precursor. While it is 
possible to prepare films richer in Si using [Al(OSiEt3)3]> rather than 
Al(O'Pr)2(OSiMes), the Al:Si ratio is unfortunately not easily controlled simply by the 
number of siloxy ligands per aluminum in the precursor. 


Films grown from the single-source precursor Al(O'Pr)»(OSiMe;) crystallize to kyanite, 
Al)SiOs, whereas those grown from [Al(OSiEt3)3]> remained amorphous even after 
annealing. 
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Introduction to Nitride Chemical Vapor Deposition 


The refractory nature and high dielectric properties of many nitrides make 
them attractive for chemical and electronic passivation. As a consequence 
silicon nitride has become the standard within the semiconductor industry, 
as both an encapsulation layer and as an etch mask. 


In a similar manner to oxide growth by chemical vapor deposition (CVD), 
two sources are generally required for binary nitride CVD: the element of 
choice and a nitrogen source. However, unlike the CVD of oxides, 
elemental nitrogen (N>) is not reactive, even at elevated temperatures, 
thereby requiring plasma enhancement. Even with plasma enhanced CVD 
(PECVD), N> does not yield high quality films. As a substitute for N>, 
ammonia (NH3) has found general acceptance as a suitable nitrogen source. 
It is a gas, readily purified and cheap, however, it is of low reactivity at low 
temperatures. PECVD has therefore found favor for low temperature NH3- 
based precursor systems. 


Recent attempts to lower deposition temperatures have included the use of 
more reactive sources (e.g., Hy NNH>) and precursors containing nitrogen as 
a coordinated ligand. Probably the most important discovery with respect to 
nitride deposition is the use of a transamination reaction between amido 
compounds and ammonia ((link]). 

Equation: 


M—NR, + NH; —————» M—NH, + HNR, 
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Chemical Vapor Deposition of Silicon Nitride and Oxynitride 


Introduction 


Stoichiometric silicon nitride (SizN,4) is used for chemical passivation and encapsulation of 
silicon bipolar and metal oxide semiconductor (MOS) devices, because of its extremely good 
barrier properties for water and sodium ion diffusion. Water causes device metallization to 
corrode, and sodium causes devices to become electrically unstable. Silicon nitride is also used 
as a mask for the selective oxidation of silicon, and as a strong dielectric in MNOS (metal- 
nitride-oxide-silicon) structures. 


The use of ion implantation for the formation of active layers in GaAs MESFET devices ([link]) 
allow for control of the active layer thickness and doping density. Since implantation causes 
structural disorder, the crystal lattice of the GaAs must be subjected to a post implantation rapid 
thermal anneal step to repair the damage and to activate the implanted species. The required 
annealing temperature (> 800 °C) is higher than the temperature at which GaAs decomposes. 
Silicon nitride encapsulation is used to prevent such dissociation. Silicon nitride is also used for 
the final encapsulation of GaAs MESFET devices ((link]). 


Source Gate Drain 
contact contact contact 


Contact metal 


Gate layer 
Buffer layer 


Schematic diagrams of a GaAs metal-semiconductor 
field effect transistor (MESFET). Adapted from A. R. 
Barron, in CVD of Nonmetals, Ed. W. S. Rees, Jr., Wiley, 
NY (1996). 


The deposition of Si3N, is a broadly practiced industrial process using either grown by low 
pressure CVD (LPCVD) or plasma enhanced CVD (PECVD) with comparable properties for the 
grown films ([Llink]). 
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A summary of some typical CVD systems for silicon nitride is given in [Link]. 
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Summary of the properties of silicon nitride grown in typical commercial systems. 


One of the disadvantages of Si3N, is its high dielectric constant that may limit device speed at 
higher operating frequencies. It is hoped that silicon oxynitride (SiON) films will exhibit the best 
properties of Si3N4 and SiO», namely the passivation and mechanical properties of Si3N, and the 
low dielectric constant and low stress of SiO>. 
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Precursors and deposition conditions for SizN4 CVD. 


CVD of silicon nitride from hydrides and chlorides 


The first commercial growth of silicon nitride was by the reaction of SiH, and NH3 by either 
atmospheric pressure CVD (APCVD) or PECVD. Film growth using APCVD is slower and 
requires higher temperatures and so it has been generally supplanted by plasma growth, however, 
film quality for APCVD is higher due to the lower hydrogen content. While thermally grown 
films are close to stoichiometric, PECVD films have a composition in which the S/N ratio is 
observed to vary from 0.7 - 1.1. The non-stoichiometric nature of PECVD films is explained by 
the incorporation of significant hydrogen in the films (10 - 30%). PECVD of SiN, using SiH,/N> 
leads to electronically leaky films due to the porous nature of the films, however, if an electron 
cyclotron resonance (ECR) plasma is employed, SiNx films of high quality may be deposited on 
ambient temperature substrates. 


The more recent commercial methods for silicon nitride deposition involves LPCVD using 
SiCl)H) as the silicon source in combination with NH3 at 700 - 900 °C. The reduced pressure of 
LPCVD has the advantages of high purity, low hydrogen content, stoichiometric films, with a 
high degree of uniformity, and a high wafer throughput. It is for these reasons that LPCVD is 
now the method of choice in commercial systems. A large excess of NH3 is therefore used in 
commercial systems to obtain stoichiometric films. Silicon nitride has also been prepared from 
SiCl4/NHy, SiBr4/NH3, and, more recently, SipCl¢/NH3. 


Silicon oxynitride (SiON) may be prepared by the use of any of the precursors used for silicon 
nitride with the addition of either N»O or NO as an oxygen source. The composition and 
properties of the SiO,N, films may be varied from SiOQ>-like to SigN,-like by the variation of the 
reactant flow rates. 


SiCl)H> gas plumbing to a LPCVD reactor must be thermally insulated to prevent condensation 
that would otherwise lead to hazy deposits on the film. The volatile by-products from CVD 
produce NH,Cl at the exhaust of the reaction tube, and in the plumbing and pumping system. It 
would be desirable, therefore, to find an alternative, chlorine-free silicon source with none of the 
toxicity or pyrophoricity problems associated with SiHy,. It is for this reason that organosilicon 
compounds have been investigated. 


CVD from organosilicon precursors 


Diethylsilane, EtpSiHz, has shown promise as a replacement for SiH, in the low temperature 
LPCVD of SiOs, and has been investigated as a source for SiN, and SiON, films. Deposition by 
LPCVD in the presence of NH3 produces SiN, films, in which the carbon contamination (4 - 
9%) depends on the partial pressure of the EtySiH». The presence of carbon raises the refractive 
index (2.025 - 2.28) with respect to traditional LPCVD films (2.01). Mixtures of EtpSiH2, NH3, 
and NO deposit SiO,N, films where the composition is controlled by the NH3:N,O ratio. 


CVD from silicon-nitrogen compounds 


The incorporation of carbon into silicon nitride films is a persistent problem of organosilicon 
precursors. Several studies have been aimed at developing single source precursors containing a 
Si-N bond rather than Si-C bonds. Polyazidosilanes, R,Si(N3)4_,, are low in carbon and 
hydrogen, reasonably volatile, and contain highly activated nitrogen, however, they represent a 
significant explosive hazard: they are explosive with an equivalent force to TNT. Films deposited 
using EtSi(N3)3 and (tBu)Si(N3)3 showed promise, despite the observation of oxygen and 
carbon. Pyrolytic studies on the azide precursors suggest that the primary decomposition step is 
the loss of dinitrogen, which is followed by migration of the alkyl onto the remaining nitrogen, 
[link]. The fact that neither the addition of NH3 or H> influence the film deposition rate suggest 
that the intramolecular nitride formation process is fast, relative to reaction with NH3, or 


hydrogenation. 
Equation: 
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Carbon incorporation is also observed for the APCVD deposition from Si(NMe>),H4_, (n = 2 - 
4). However, using the Hoffman transamination reaction, deposition in the presence of NH3 
completely removed carbon incorporation into the stoichiometric Si3N, film. From FTIR data, 
the hydrogen content was estimated to be 8 - 10 atom percent. While the Si(NMe>),H4_,/NH3 
system does not provide substantially lower temperatures than APCVD using SiH,/NH3 growth 
rates are significantly higher. Unlike the azide precursors, Si(NMe>),H4_, are easier to handle 
than either SiH, or SiCljH>. 
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Chemical Vapor Deposition of Aluminum Nitride 


Introduction 


Aluminum nitride (AIN) has potential for significant applications in microelectronic and 
optical devices. It has a large direct bandgap (Eg qj = 6.28 eV), extremely high melting point 
(3000 °C), high thermal conductivity (2.6 W/cm.K), and a large dielectric constant (€ = 9.14). 
In present commercial microelectronic devices, AIN is used most often as a packaging 
material, allowing for the construction of complex packages with many signal, ground, power, 
bonding, and sealing layers. Aluminum nitride is especially useful for high power applications 
due to its enhanced thermal conductivity. Chemical vapor deposition (CVD) grown thin films 
of AIN have been centered upon its use as a high gate-insulation layer for MIS devices, and a 
dielectric in high-performance capacitors. One additional property of AIN that makes it a 
promising insulating material for both Si and GaAs devices is that its thermal expansion 
coefficient is almost identical to both of these semiconductors. 


The lack of a suitably volatile homoleptic hydride for aluminum (A1Hs is an involatile 
polymeric species) led to the application of aluminum halides and organometallic compounds 
as precursors. A summary of selected precursor combinations is given in [link]. 


Aluminum Nitrogen Carrier CVD Deposition 


precursor source gas method temp. (°C) Sour 
AICl3(NH3) - Np LPCVD 700 - 1400 ees 
present 
AIBrs3 NH3 N> APCVD 400 - 900 Br present 
AIBr3 N5 N5 LPCVD 520-560 oe 
growth 
AlMe3 NH; H> LPCVD 1200 
AlMe3 NH3 He APCVD 350 - 400 
N-H and 
ae AIN-N 
AlMe3 cracked H,/He APCVD 310 - 460 
bonds 
NH; 
detected 


AlMe, ‘BuNH, Hp APCVD 400 - 600 high C 


or content, 
*PrNH> low N 


AlMe3 Me3SiN3 Hy APCVD 300 - 450 pe 
poor film 
quality, 
high C 
content 


[R2Al(NH>)]3 


ean H> LPCVD 400 - 800 


unreacted 
[RoAIN3]3 (R - LPCVD 400 - 500 precursor 
= Me, Et) present on 
film 


amorphous 
100 - 200 
dO 
crystalline 
300 - 500 
°C 


Al(NMe;)3 NH; He APCVD 100-500 


Precursors and deposition conditions for AIN CVD. 


CVD from halides 


The observation that AIN powder may be produced upon the thermal decomposition of the 
AICl3(NH3) complex, prompted initial studies on the use of AlCl3/NH3 for the CVD of AIN 
films. Initially, the low volatility of AlCl; (a polymeric chain structure) required that the 
AICl3(NH3) complex to be used as a single precursor. Low pressure CVD (LPCVD) at 5 -10 
Torr resulted in deposition of AIN films, although films deposited below 1000 °C were 
contaminated with NH,Cl, and all the films contained chlorine. Films with reasonable 
electrical properties were prepared by the use of the more volatile tris-ammonia complex, 
AICIl3(NH3)3. The dielectric constant for films grown at 800 - 1000 °C (11.5) is higher than 
bulk AIN (9.14) and also than that of the films grown at 1100 °C (8.1). All the films were 
polycrystalline with the grain size increasing with increasing deposition temperatures and 
preferred orientation was observed only for the films grown below 1000 °C. 


Aluminum bromide is a dimeric volatile compound, [BrjAl(p-Br)]>, and is more attractive as a 
CVD source, than AICl3. Deposition of AIN films can be accomplished using AlBr3 and NH3 
in an APCVD system with Hp as the carrier gas. The mechanism of film growth has been 
proposed ([link]). 


AIBr(g) + NH3(g) 


surface adsorption 
homogeneous surface reaction 


reaction AIBr,.NH3(surface) ——————————_»_ AIN 


decomposition 
of intermediate film 
surface adsorption compounds 


AIBr,(NH3 )(g) 
Mechanism of APCVD film growth of AIN using AIBrs 
and NH3. 


Due to the high temperatures required (750 °C) for good quality AIN film growth from AIBrs, 
PECVD was investigated. Using an AlBr3-H»-N> gas mixture and a 2450 MHz microwave 
(100 - 1000 W) plasma source, AIN films were grown. The maximum deposition rate occurred 
with an N>/AIBr3 ratio of ca. 20 and a substrate temperature ca. 600 °C. 


CVD from aluminum alkyls 


Based upon the successful metal organic CVD (MOCVD) growth of AlGaAs using the alkyl 
derivatives, AIR3, it was logical to extend MOCVD to aluminum nitride. Initial studies were 
performed using AlMe3 and NH3 with H) carrier gas. While these films are generally of high 
quality, the temperature of deposition is incompatible with semiconductor processing (being 
above both the melting point of most metallization alloys and the temperature at which dopant 
migration becomes deleterious). Lower temperatures (as low as 350 °C) were explored, 
however significant pre-reaction was observed between AlMe3 and NH3; causing depletion of 
the reactants in the deposition zone, reducing the growth rate and leading to non-uniform 
deposits. Two routes have been investigated by which this problem can be circumvented. 


PECVD successfully lowers the deposition temperature, although, degradation of the substrate 
surface by ion bombardment is a significant drawback. Given that it is the ammonia 
decomposition that represents the highest energy process, pre-cracking should lower the 
overall deposition temperature. This is indeed observed for the AlMe3/NH3-based AIN system 
where growth is achieved as low as 584 °C if the NHs3 is catalytically cracked over a heated 
tungsten filament (1747 °C). In fact, with catalytic pre-cracking, deposition rates were 
observed to be an order of magnitude greater than for PECVD at the same temperatures, 
resulting in films that were crystalline with columnar growth. For this approach to low- 
temperature MOCVD growth of AIN the only major drawback is the presence of residual N-H 
and AIN-N groups detected by FT-IR. 


Chemical solutions to the high stability of NH3 have primarily centered upon the use of 
alternative nitrogen sources. The use of the volatile nitrogen source hydrazine (N>Hy), has 
allowed for the growth of AIN at temperatures as low as 220 °C, however, hydrazine is 
extremely toxic and highly unstable, restricting its commercial application. Primary amines, 
such as ‘BuNH) or 'PrNHp, allow for deposition at modest temperatures (400 - 600 °C). The 


high carbon incorporation, as high as 17% precludes their adoption. A similar problem is 
observed with the use of trimethylsilylazide, Me3SiN3. The presence of carbon contamination 
in the deposition of Al films and AlGaAs epitaxial layers has been attributed to the use of 
AlMe3. Therefore attempts have been made to use alternative aluminum precursors. 


Interest in the mechanism of nucleation and atomic layer growth of AIN has prompted several 
mechanistic studies of the formation of Al-N bonds on the growth surface. All the studies 
concurred that the mechanism involves a step-wise reaction where the amide (-NH>-) groups 
form covalent bonds to aluminum irrespective of substrate. A schematic representation of the 
process is shown in [link]. 


H3C. NH. NH. 

ee ee a ab 
4: 4: 4: fc 

O oO oO Oo —_—_——_ 
ie loetiog ao ies eee 
Si Si Si Si i j i i 


cata a 


AL 2 AL ¢s Al 
VS ONS 
2 
}): ‘Al - 

= £3 + AI(CH3)3 


fodody Aa fo fe 9 
CELLET ZECcAZZ--=- ZZ 


ga 


A schematic representation of the proposed step- 
wise reaction involving the formation of amide (- 
NH)-) groups covalently bound to aluminum 
during the MOCVD growth of AIN using 
AlMe;3/NHs3. (Adapted from M. E. Bartram, T. A. 
Michalske, J. W. Rogers, Jr., and R. T. Paine, 
Chem. Mater., 1993, 5, 1424). 


CVD from aluminum amide and related compounds 


The reaction between aluminum alkyls and amines ([link]), as well as the formation of AIN 
powders from the pyrolysis of AlMe3(NHs3) ([link]), lead to the misguided concept that the 
route to high-purity AIN would be through the so-called single source precursor route. 
Equation: 


AIR, + HNR', —4> 1/[R,AI(NR')], + RH 


Equation: 


AlMe,(NH;) > AIN + 3 MeH 


The trimeric dimethylaluminum amide, [Me,Al(NH>)]3 ([link]a), was originally used as a 
single source precursor for growth of AIN under LPCVD conditions using a hot walled 
reactor, although subsequent deposition was also demonstrated in a cold walled system. Film 
quality was never demonstrated for electronic applications, but the films showed promise as 
fiber coatings for composites. The concept of using a trimeric single source precursor for AIN 
was derived from the observation of Al3N3 cycles as the smallest structural fragment in 
wurtzite AIN. However, detailed mechanistic studies indicate that under gas phase thermolysis 
the trimeric precursor [Me Al(NH2)]3 is in equilibrium with (or decomposes to) dimeric 
({link]b) and monomeric ({link]c) compounds. Furthermore, nitrogen-poor species ({link]d) 
were also observed by TOF-mass spectrometry. 
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The trimeric dimethylaluminum amide (a) 
used as a single source precursor for growth 
of AIN, and the decomposition products (b - 

d) observed by TOF-mass spectrometry. 


Following the early reports of single source precursor routes, a wide range of compounds have 
been investigated, including [Al(NR>)3]2, [HAI(NR»)212 (R = Me, Et), and [Me,AIN(Pr)9]p, 
all of which gave AIN, but none of these precursors give films of superior quality comparable 
to that obtained from traditional CVD. In particular, the films contained significant carbon 
contamination, prompting further investigations into the efficacy of, N-C bond free, 
dialkylaluminum azides, [R>AI(N3)]3, as LPCVD precursors. 


While aluminum tris-amides, Al(NR>)3 were shown to give carbon-contaminated films, 
APCVD carried-out with NH3 as the carrier gas results in carbon-free AIN film growth as low 
as 100 °C. The reason for the deposition of high quality films at such low temperatures resides 


with the Hoffman transamination reaction between the primary amido unit and ammonia. The 
crystallinity, bandgap and refractive index for the AIN grown by APCVD using [Al(NMe3)3]. 
and NHg3 are dependent on the deposition temperature. Films grown at 100 - 200 °C are 
amorphous and have a low bandgap and low refractive index. Above 300 °C, the films are 
crystalline, and have a refractive index close to that of bulk AIN (1.99 - 2.02), with a bandgap 
(< 5.77 eV) approaching the values reported for polycrystalline AIN (5.8 - 5.9 eV). 
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Metal Organic Chemical Vapor Deposition of Calcium Fluoride 


The chemical vapor deposition (CVD) of metal fluorides has been much 
less studied than that of oxides, pnictides, or chalgogenides. As may be 
expected where a volatile fluoride precursor is available then suitable films 
may be grown. For example, Group 5 (V, Nb, Ta), 6 (Mo, W), and 7 (Re) 
transition metals are readily deposited from fluoride-hydrogen mixtures. 
While the use of fluorine is discouraged on safety grounds, many of the 
fluorinated alkoxide or B-diketonate ligands employed for metal oxide 
metal organic chemical vapor deposition (MOCVD) are predisposed to 
depositing metal fluorides. The use of fluorine substituted derivatives is 
because they are often more volatile than their hydrocarbon analogs, and 
therefore readily used for both atmospheric and low pressure CVD. To 
minimize the unwanted formation of metal fluorides, water vapor is 
incorporated in the gas stream, and it is common to perform post-deposition 
hydrolytic anneals. However, there exist a number of applications where 
fluorides are required. For example, the highly insulating nature of CaF 
and SrF> has prompted investigations into their use as a gate insulator in 
GaAs-based metal insulator semiconductor field effect transistor (MISFET) 
devices. It should be noted that while CaF) is a good insulator, the 
CaF>/GaAs interface has a high interface trap density, requiring a 
passivation buffer layer to be deposited on GaAs prior to CaF» growth. 


One of the difficulties with the use of CaF, (and SrF>) on GaAs is the lattice 
mismatch ([link]), but this may be minimized by the use of solid solutions 
between CaF -SrF>. The composition Cag 44Sro.5¢F 2 is almost perfectly 
lattice-matched to GaAs. Unfortunately, the thermal expansion coefficient 
differences between GaAs and CaF>-SrF> produce strains at the 
film/substrate interface under high temperature growth conditions. The 
solution to this latter problem lies in the low temperature deposition of 
CaF>-SrF) by CVD. 


Compound Lattice constant (A) 


CaF, 5.46 


SrF> 5.86 
BaF»> 6.20 
GaAs 5.6532 


Lattice parameters of Group 2 (II) fluorides in comparison with GaAs. 


Polycrystalline CaF may be grown by the pyrolytic decomposition of 
Ca(CsMes)> ([link]a) in either SiF, or NF3. Deposition at 150 °C results in 
polycrystalline films with high levels of carbon (18%) and oxygen (7%) 
impurities limiting the films usefulness in electronic applications. However, 
significantly higher purity films may be grown at 100 °C using the photo- 
assisted decomposition of Ca(hfac), ({link]b). These films were deposited at 
30 A/min and showed a high degree of crystallographic preferred 
orientation. 
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CaF», MOCVD precursors. 


The mechanism enabeling fluoride transfer to the metal (from the carbon of 
fluorinated alkoxide ligands) has been investigated. MOCVD employing 
[Na(OR,)]4 and Zr(OR¢)4 [ORs = OCH(CF3)5 and OCMe3_,(CF3),, n = 1 - 
3] gives NaF and ZrF, films, respectively, with volatile fluorocarbon side- 
products. Analysis of the organic side-products indicated that 


decomposition occurs by transfer of fluorine to the metal in conjunction 
with a 1,2-migration of a residual group on the alkoxide, to form a ketone 
({link]). The migration is increasingly facile in the order CF3 << CH3 < H. 
The initial M-F bond formation has been proposed to be as a consequence 
of the close MF agostic interactions observed for some fluoroalkoxide and 
fluoro-B-diketonates. 
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Proposed mechanism for the decomposition of 
fluorinated alkoxide compounds. (Adapted from J. 
A. Samuels, W. -C. Chiang, C. -P. Yu, E. Apen, D. 

C. Smith, D. V. Baxter, K. G. Caulton, Chem. 
Mater., 1994, 6, 1684). 
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Precursors for Chemical Vapor Deposition of Copper 


Note: This module was developed as part of the Rice University course CHEM- 
496: Chemistry of Electronic Materials. This module was prepared with the 
assistance of Wei Zhao. 


Introduction 


Chemical vapor deposition (CVD) is a process for depositing solid elements and 
compounds by reactions of gas-phase molecular precursors. Deposition of a 
majority of the solid elements and a large and ever-growing number of 
compounds is possible by CVD. 


Most metallization for microelectronics today is performed by the physical 
vapor deposition (PVD) processes of evaporation and sputtering, which are 
often conceptually and experimentally more straightforward than CVD. 
However, the increasing importance of CVD is due to a large degree to the 
advantages that it holds over physical vapor deposition. Foremost among these 
are the advantages of conformal coverage and selectivity. Sputtering and 
evaporation are by their nature line-of-sight deposition processes in which the 
substrate to be coated must be placed directly in front of the PVD source. In 
contrast, CVD allows any substrate to be coated that is in a region of sufficient 
precursor partial pressure. This allows the uniform coating of several substrate 
wafers at once, of both sides of a substrate wafer, or of a substrate of large size 
and/or complex shape. The PVD techniques clearly will also deposit metal on 
any surface that is in line of sight. On the other hand, it is possible to deposit 
selectively on some substrate materials in the presence of others using CVD, 
because the deposition is controlled by the surface chemistry of the 
precursor/substrate pair. Thus, it may be possible, for example, to synthesize a 
CVD precursor that under certain conditions will deposit on metals but not on 
an insulating material such as SiO», and to exploit this selectivity, for example, 
in the fabrication of a very large-scale integrated (VLSI) circuit. It should also 
be pointed out that, unlike some PVD applications, CVD does not cause 
radiation damage of the substrate. 


Since the 1960s, there has been considerable interest in the application of metal 
CVD for thin-film deposition for metallization of integrated circuits. Research 
on the thermal CVD of copper is motivated by the fact that copper has physical 
properties that may make it superior to either tungsten or aluminum in certain 
microelectronics applications. The resistivity of copper (1.67 mW.cm) is much 
lower than that of tungsten (5.6 mW.cm) and significantly lower than that of 
aluminum (2.7 mW.cm). This immediately suggests that copper could be a 
superior material for making metal interconnects, especially in devices where 
relatively long interconnects are required. The electromigration resistance of 
copper is higher than that of aluminum by four orders of magnitude. Copper has 
increased resistance to stress-induced voidage due to its higher melting point 
versus aluminum. There are also reported advantages for copper related device 
performance such as greater speed and reduced cross talk and smaller RC time 
constants. On the whole, the combination of superior resistivity and 
intermediate reliability properties makes copper a promising material for many 
applications, provide that suitable CVD processes can be devised. 


Applications of metal CVD 


There are a number of potential microelectronic applications for metal CVD, 
including gate metallization (deposit on semiconductor), contact metallization 
(deposit on semiconductor), diffusion barrier metallization (deposit on 
semiconductor), interconnect metallization (deposit on insulator and conductor 
or semiconductor). Most of the relevant features of metal CVD are found in the 
interconnect and via fill applications, which we briefly describe here. There are 
basically two types of metal CVD processes that may occur: 


1. Blanket or nonselective deposition, in which deposition proceeds 
uniformly over a variety of surfaces. 

2. Selective deposition in which deposition only occurs on certain types of 
surfaces (usually semiconductors or conductors, but not insulators). 


A primary application of blanket metal CVD is for interconnects. The conformal 
nature of the CVD process is one of the key advantages of CVD over PVD and 
is a driving force for its research and development. The degree of conformality 
is usually described as the “step coverage”, which is normally defined as the 
ratio of the deposit thickness on the step sidewall to the deposit thickness on the 
top surface. Another application for blanket metal CVD is via hole filling to 


planarize each level for subsequent processing, This is achieved by depositing a 
conformal film and etching back to the insulator surface, leaving the metal 
“plug” intact. Another unique aspect of CVD is its potential to deposit films 
selectively, which would eliminate several processing steps required to perform 
the same task. The primary application for selective metal CVD would be for 
via hole filling. Ideally, deposition only occurs on exposed conductor or 
semiconductor surfaces, so filling of the via hole is achieved in a single step. 


Copper CVD 


The chemical vapor deposition of copper originally suffered from a lack of 
readily available copper compounds with the requisite properties to serve as 
CVD precursors. The successful development of a technologically useful copper 
CVD process requires first and foremost the design and synthesis of a copper 
precursor which is volatile, i.e., possesses an appreciable vapor pressure and 
vaporization rate to allow ease in transportation to the reaction zone and 
deposition at high growth rates. Its decomposition mechanism(s) should 
preferably be straightforward and lead to the formation of pure copper and 
volatile by-products that are nonreactive and can be cleanly removed from the 
reaction zone to prevent film, substrate, and reactor contamination. Gaseous or 
liquid sources are preferred to solid sources to avoid undesirable variations in 
vaporization rates because of surface-area changes during evaporation of solid 
sources and to permit high levels of reproducibility and control in source 
delivery. Other desirable features in precursor selection include chemical and 
thermal stability to allow extended shelf life and ease in transport and handling, 
relative safety to minimize the industrial and environmental impact of 
processing and disposal, and low synthesis and production costs to ensure an 
economically viable process. 


Several classes of inorganic and metalorganic sources have been explored as 
copper sources. Inorganic precursors for copper CVD used hydrogen reduction 
of copper halide sources of the type CuX or CuX», where X is chlorine (Cl) or 
fluorine (F): 


BCU tg 32 Gu 2 FLX 


CuX) + Hy - Cu+2 HX 


The volatility of copper halides is low, the reactions involved require 
prohibitively high temperatures (400 - 1200 °C), lead to the production of 
corrosive by-products such as hydrochloric and hydrofluoric acids (HCl and 
HF), and produce deposits with large concentrations of halide contaminants. 
Meanwhile, the exploration of metalorganic chemistries has involved various 
copper(II) and copper(I) source precursors, with significant advantages over 
inorganic precursors. 


From Cu(II) precursors 


Volatile Cu(I]) compounds 


Copper was known to form very few stable, volatile alkyl or carbonyl 
compounds. This was thought to eliminate the two major classes of compounds 
used in most existing processes for CVD of metals or compound 
semiconductors. Copper halides have been used for chemical vapor transport 
growth of Cu-containing semiconductor crystals. But the evaporation 
temperatures needed for copper halides are much higher than those needed for 
metal-organic compounds. Film purity and resistivity were also a problem, 
possibly reflecting the high reactivity of Si substrates with metal halides. 


Cu(II) compounds that have been studied as CVD precursors are listed in [link]. 
The structural formulas of these compounds are shown in [link] along with the 
ligand abbreviations in [link]. Each compound contains a central Cu(II) atom 
bonded to two singly charged f-diketonate or B-ketoiminate ligands. Most of 
them are stable, easy to synthesize, transport and handle. 


Evaporation Deposition Carrier | Reactor 
Suepoun temp. (°C) temp. (°C) as pressure 
‘i a (Torr) 
Cu(acac), 180 - 200 225 - 250 H/Ar 760 


Cu(hfac), 80 - 95 250 - 300 Hy 760 


Cu(tfac), 135 - 160 250 - 300 H> 760 


Cu(dpm), 100 400 none <10-4 

Cu(ppm)> 100 400 none <0.3 

Cu(fod), ‘ 300 - 400 Hp 10™ - 
760 

Cu(acim)> 287 400 Hp 730 

oer 85 - 105 270 - 350 He 10-70 

2 
Cu(acen)> 204 450 H> 730 


Studies of Cu CVD using Cu(II) compound. Adapted from T. Kodas and M. 
Hampden-Smith, The Chemistry of Metal CVD, VCH Publishers Inc., New York, 
NY (1994). 
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(a) (b) (c) 


Structures of Cu(II) compounds studied as CVD precursors. 


Ligand Structural 


abbreviation Pe 7 type 
acac CH3 CH3 a 
hfac CF3 CF3 a 
tfac CH3 CF3 a 
dpm C(CH3)3 C(CH3)3 a 
ppm C(CH3)3 CF »CF3 a 
fod C(CH3)3 CF )CF CF3 a 
acim CH3 H b 
nona-F CF3 CH»CF3 b 
acen CH3 - Cc 


Ligand abbreviations for the structures shown in [link]. 


Attention has focused on Cu(II) B-diketonate [i.e., Cu(tfac)., Cu(hfac).] and 
Cu(ID B-ketoiminate [i.e., Cu(acim), Cu(acen),]. An important characteristic of 
Cu(II) compounds as CVD precursors is the use of heavily fluorinated ligand 
such as Cu(tfac) and Cu(hfac)5 versus Cu(acac). The main effort of fluorine 
substitution is a significant increase in the volatility of the complex. 


Synthesis of Cu(II) precursors 


Cu(hfac))"nH)O (n = 0, 1, 2) 


Cu(hfac), is by far the most extensively studied of the Cu(II) CVD precursors. 
Preparations in aqueous solutions yield the yellow-green dihydrate, 
Cu(hfac) -2H,O. This is stable in very humid air or at lower temperatures but 


slowly loses one molecule of water under typical laboratory conditions to form 
the “grass-green” monohydrate, Cu(hfac)»-H»O. The monohydrate, which is 
commercially available, can be sublimed unchanged and melts at 133 — 136 °C. 
More vigorous drying over concentrated H»SO, produces the purple anhydrous 
compound Cu(hfac), (mp = 95 — 98 °C). The purple material is hydroscopic, 
converting readily into the monohydrate. Other B-diketonate Cu(II) complexes 
are prepared by the similar method. 


Schiff-base complexes 


Schiff-base complexes include Cu(acim),, Cu(acen) and Cu(nona-F)>. The first 
two of these can be prepared by mixing Cu(NH3),°* (aq) with the pure ligand 
and by adding freshly prepared solid Cu(OH)> to a solution of the ligand in 
acetone. The synthesis of Cu(nona-F),, on the other hand, involved two 
important developments: the introduction of the silyl enol ether route to the 
ligand and its conversion in-situ into the desired precursor. The new approach to 
the ligand was required because, in contrast to non-fluorinated b-diketonates, 
H(hfac) reacts with amines to produce salts. 


Reaction mechanism 


Starting from the experimental results, a list of possible steps for Cu CVD via 
H, reduction of Cu(II) compounds would include the followings, where removal 
of adsorbed ligand from the surface is believed to be the rate limiting step: 


Cu(IDL>(g) > Cu(s) + 2 L-(ads) 
H)(g) > 2 H(ads) 
L-(ads) + H(ads) — HL(g) 


where L represents any of the singly charged B-diketonate or B-ketoiminate 
ligands described before. This mechanism gives a clear explanation of the 
importance of hydrogen being present: in the absence of hydrogen, HL cannot 
desorb cleanly into the gas phase and ligand will tend to decompose on the 
surface, resulting in impurity incorporation into the growing film. The 
mechanism is also supported by the observation that the deposition reaction is 


enhanced by the addition of alcohol containing B-hydrogen to the reaction 
mixture. 


More recently, the focus has shifted to Cu(I) compounds including Cu(I) 
cyclopentadienyls and Cu(I) B-diketonate. The Cu(I) B-diketonate in particular 
show great promise as Cu CVD precursors and have superseded the Cu(II) B- 
diketonate as the best family of precursors currently available. 


From Cu(I) precursors 


Precursor design 


The Cu(I) compounds that have been investigated are described in [link]. These 
species can be broadly divided into two classes, CuX and XCuL,, where X is a 
uninegative ligand and L is a neutral Lewis base electron pair donor. The 
XCuL, class can be further subdivided according to the nature of X and L. 


Copper(I) Precursors 


CuX a Vv 
[Cu(OR)Iq a WN eke 
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Copper(I) precursors used for CVD. Adapted from T. Kodas and 
M. Hampden-Smith, The Chemistry of Metal CVD, VCH 


Publishers Inc., New York, NY (1994). 


Compounds of general formula CuX are likely to be oligomeric resulting in a 
relatively low vapor pressure. The presence of a neutral donor ligand, L, is 
likely to reduce the extent of oligomerization compared to CuX by occupying 
vacant coordination sites. Metal alkoxide compounds are expected to undergo 
thermal decomposition by cleavage of either M-O or O-C bonds. 


Organo-copper(I) compounds, RCuL, where R is alkyl, are thermally unstable, 
but cyclopentadienyl compounds are likely to be more robust due to the m- 
bonding of the cyclopentadieny] ligand to the copper center. At the same time, 
the cyclopentadieny] ligand is sterically demanding, occupies three coordination 
sites at the metal center, and thereby reduces the desire for oligomerization. In 
general, a cyclopentadieny] ligand is a poor choice to support CVD precursors, 
especially with electropositive metals, because this ligand is unlikely to be 
liable. Compounds in the family XCuL», where X is a halide and L is a 
triorganophosphine, exhibit relatively high volatility but are thermally stable 
with respect to formation of copper at low temperatures. These species are 
therefore suitable as products of etching reactions of copper films. 


A number of researchers have demonstrated the potential of a series of B- 
diketonate Cu(I) compounds, (f-diketonate)CuL,, where L is Lewis base and n 
= 1 or 2, that fulfill most of the criteria outlined for precursor design before. 
These species were chosen as copper precursors for the following reasons: 


e They contain the §-diketonate ligand which generally imparts volatility to 
metal-organic complexes, particularly when fluorinated, as a result of a 
reduction in hydrogen-bonding in the solid-state. 

e They are capable of systematic substitution through both the B-diketonate 
and Lewis base ligands to tailor volatility and reactivity. 

e Lewis bases such as phosphines, olefins and alkynes are unlikely to 
thermally decompose at temperatures where copper deposition occurs. 

e These precursors can deposit copper via thermally induced 
disproportionation reactions and no ligand decomposition is required since 
the volatile Lewis base the Cu(II) disproportionation products are 
transported out of the reactor intact at the disproportionation temperature. 


Reaction mechanism 


A general feature of the reactions of Cu(I) precursors is that they thermally 
disproportionate, a mechanism likely to be responsible for the high purity of the 
copper films observed since ligand decomposition does not occur. The 
disproportionation mechanism is shown in [link] for (B-diketonate)CuL. The 
unique capabilities of this class of compounds result from this reaction 
mechanism by which they deposit copper. This mechanism is based on the 
dissociative adsorption of the precursor to form Cu(hfac) and L, 
disproportionation to form Cu(hfac), and Cu and desorption of Cu(hfac), and L. 
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Schematic diagram of the disproportionation mechanism. Adapted 
from T. Kodas and M. Hampden-Smith, The Chemistry of Metal 
CVD, VCH Publishers Inc., New York, NY (1994). 
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Thus, the starting material acts as its own reducing agent and no external 
reducing agent such as H> is required. Another advantage of the Cu(I) B- 
diketonates over the Cu(II) B-diketonates is that in the former the ligand L can 
be varied systematically, allowing the synthesis of a whole series of different but 
closely related compounds. 


Selectivity 


Selectivity deposition has been studied in both hot- and cold-wall CVD reactors 
as a function of the nature of the substrate, the temperature of the substrate and 
the nature of the copper substituents. Selectivity has usually been evaluated by 
using Si substrates on which SiO, has been grown and patterned with various 
metals by either electron-beam deposition, CVD or sputtering. Research has 
suggested that selectivity on metallic surfaces is attributable to the biomolecular 
disproportionation reaction involved in precursor decomposition. 
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Rutherford Backscattering of Thin Films 


Introduction 


One of the main research interests of the semiconductor industry is to improve the 
performance of semiconducting devices and to construct new materials with reduced size or 
thickness that have potential application in transistors and microelectronic devices. 
However, the most significant challenge regarding thin film semiconductor materials is 
measurement. Properties such as the thickness, composition at the surface, and 
contamination, all are critical parameters of the thin films. To address these issues, we need 
an analytical technique which can measure accurately through the depth of the of the 
semiconductor surface without destruction of the material. Rutherford backscattering 
spectroscopy is a unique analysis method for this purpose. It can give us information 
regarding in-depth profiling in a non-destructive manner. However X-ray photo electron 
spectroscopy (XPS), energy dispersive X-ray analysis (EDX) and Auger electron 
spectroscopy are also able to study the depth-profile of semiconductor films. [link] 
demonstrates the comparison between those techniques with RBS. 


Method Hestuacave Incident Outgoing Detection Depth 


particle Particle limit resolution 

RBS No Ton Ion al 10 nm 
X-ray 

XPS Yes Electron ~0.1-1 ~1 pm 
photon 

EDX Yes Electron ale ~0.1 1.5 nm 

photon 
Auger Yes Electron Electron ~0.1-1 1.5 nm 


Comparison between different thin film analysis techniques. 


Basic concept of Rutherford backscattering spectroscopy 


At a basic level, RBS demonstrates the electrostatic repulsion between high energy incident 
ions and target nuclei. The specimen under study is bombarded with monoenergetic beam of 
4He* particles and the backscattered particles are detected by the detector-analysis system 
which measures the energies of the particles. During the collision, energy is transferred from 


the incident particle to the target specimen atoms; the change in energy of the scattered 
particle depends on the masses of incoming and target atoms. For an incident particle of 
mass My, the energy is Eg while the mass of the target atom is M>. After the collision, the 
residual energy E of the particle scattered at angle @ can be expressed as: 


E= k 2E, 


(sicoso + [M2 -M:?sin? @ 


Mit+Ma2 


k= 


where k is the kinematic scattering factor, which is actually the energy ratio of the particle 
before and after the collision. Since k depends on the masses of the incident particle and 
target atom and the scattering angle, the energy of the scattered particle is also determined 
by these three parameters. A simplified layout of backscattering experiment is shown in 
Figure 1. 
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Schematic representation of the experimental setup for 
Rutherford backscattering analysis. 


The probability of a scattering event can be described by the differential scattering cross 
section of a target atom for scattering an incoming particle through the angle @ into 
differential solid angle as follows, 


M1. 
doR | — [-os0+ Ft = Gagsinoy2|2 


dp  \2E0sin2@ 
1— eS sin@) 2 


where dog is the effective differential cross section for the scattering of a particle. The 
above equation may looks complicated but it conveys the message that the probability of 
scattering event can be expressed as a function of scattering cross section which is 
proportional to the zZ when a particle with charge ze approaches the target atom with 
charge Ze. 


Helium ions not scattered at the surface lose energy as they traverse the solid. They lose 
energy due to interaction with electrons in the target. After collision the He particles lose 
further energy on their way out to the detector. We need to know two quantities to measure 
the energy loss, the distance At that the particles penetrate into the target and the energy loss 
AE in this distance [link]. The rate of energy loss or stopping power is a critical component 
in backscattering experiments as it determines the depth profile in a given experiment. 


Depth 


Target 


Components of energy loss for a ion beam that 
scatters from depth t. First, incident beam loses 
energy through interaction with electrons AFjp. 
Then energy lost occurs due to scattering E,. 
Finally outgoing beam loses energy for interaction 
with electrons AE,,,. Adapted from L. C. Feldman 
and J. W. Mayer, Fundamentals of Surface and 
Thin Film Analysis , North Holland-Elsevier, New 
York (1986). 


In thin film analysis, it is convenient to assume that total energy loss AE into depth t is only 
proportional to t for a given target. This assumption allows a simple derivation of energy 


loss in backscattering as more complete analysis requires many numerical techniques. In 
constant dE/dx approximation, total energy loss becomes linearly related to depth t, [link]. 


Energy loss (AE) 


Thickness “1000 A 


Variation of energy loss with the depth of the target in 
constant dE/dx approximation. 


Experimental set-up 


The apparatus for Rutherford backscattering analysis of thin solid surface typically consist 
of three components: 


1. A source of helium ions. 
2. An accelerator to energize the helium ions. 
3. A detector to measure the energy of scattered ions. 


There are two types of accelerator/ion source available. In single stage accelerator, the He* 
source is placed within an insulating gas-filled tank ({link]). It is difficult to install new ion 
source when it is exhausted in this type of accelerator. Moreover, it is also difficult to 
achieve particles with energy much more than 1 MeV since it is difficult to apply high 
voltages in this type of system. 


Source 


Acceleration Tube | 
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Accelerated He* at 1 MeV 


Schematic representation of a single stage accelerator. 


Another variation is “tandem accelerator.” Here the ion source is at ground and produces 
negative ion. The positive terminal is located is at the center of the acceleration tube 
([link]). Initially the negative ion is accelerated from ground to terminal. At terminal two- 
electron stripping process converts the He to He**. The positive ions are further accelerated 
toward ground due to columbic repulsion from positive terminal. This arrangement can 
achieve highly accelerated He** ions (~ 2.25 MeV) with moderate voltage of 750 kV. 


Negative ion source 


Acceleration Tube 4 


Accelerated He* over 2 MeV 


Schematic representation of a tandem accelerator. 


Particles that are backscattered by surface atoms of the bombarded specimen are detected by 
a surface barrier detector. The surface barrier detector is a thin layer of p-type silicon on the 
n-type substrate resulting p-n junction. When the scattered ions exchange energy with the 
electrons on the surface of the detector upon reaching the detector, electrons get promoted 
from the valence band to the conduction band. Thus, each exchange of energy creates 
electron-hole pairs. The energy of scattered ions is detected by simply counting the number 
of electron-hole pairs. The energy resolution of the surface barrier detector in a standard 
RBS experiment is 12 - 20 keV. The surface barrier detector is generally set between 90° 
and 170° to the incident beam. Films are usually set normal to the incident beam. A simple 
layout is shown in [link]. 


Incident Beam 


Detector 


Scattered Beam 


Thin Film 


~165° 


Schematic representation general 
setup where the surface barrier 
detector is placed at angle of 165° to 
the extrapolated incident beam. 


Depth profile analysis 


As stated earlier, it is a good approximation in thin film analysis that the total energy loss 
AE is proportional to depth t. With this approximation, we can derive the relation between 
energy width AE of the signal from a film of thickness At as follows, 


AE = At(k dE/dx ;, + 1/cos@ dE/dx ou: ) 
where @ = lab scattering angle. 


It is worth noting that k is the kinematic factor defined in equation above and the subscripts 
“in” and “out” indicate the energies at which the rate of loss of energy or dE/dx is evaluated. 
As an example, we consider the backscattering spectrum, at scattering angle 170°, for 2 
MeV He" incidents on silicon layer deposited onto 2 mm thick niobium substrate [link]. 
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The backscattering spectrum for 2.0 MeV He ions incident on a 
silicon thin film deposited onto a niobium substrate. Adapted from 
P. D. Stupik, M. M. Donovan, A. R. Barron, T. R. Jervis and M. 
Nastasi, Thin Solid Films, 1992, 207, 138. 


The energy loss rate of incoming He** or dE/dx along inward path in elemental Si is *24.6 
eV/A at 2 MeV and is ©26 eV/A for the outgoing particle at 1.12 MeV (Since K of Si is 
0.56 when the scattering angle is 170°, energy of the outgoing particle would be equal to 2 x 
0.56 or 1.12 MeV) . Again the value of AE<; is ¥133.3 keV. Putting the values into above 
equation we get 

At © 133.3 keV/(0.56 * 24.6 eV/A + 1/cos 170° * 26 eV/A) 

= 133.3 keV/(13.77 eV/A + 29/.985 eV/A) 

= 133.3 keV/ 40.17 eV/A 

= 3318 A. 


Hence a Si layer of ca. 3300 A thickness has been deposited on the niobium substrate. 
However we need to remember that the value of dE/dx is approximated in this calculation. 


Quantitative Analysis 


In addition to depth profile analysis, we can study the composition of an element 
quantitatively by backscattering spectroscopy. The basic equation for quantitative analysis is 


Y=o0.Q.Q. NAt 


Where Y is the yield of scattered ions from a thin layer of thickness At, Q is the number of 
incident ions and Q is the detector solid angle, and NAt is the number of specimen atoms 
(atom/cm?). [link] shows the RBS spectrum for a sample of silicon deposited on a niobium 
substrate and subjected to laser mixing. The Nb has reacted with the silicon to form a NbSi, 
interphase layer. The Nb signal has broadened after the reaction as show in [link]. 
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Spectrum after the formation of niobium silicide 


Spectrum as Si deposited on Nb substrate 


100 ; 
NbSi_, layer on Nb | 


AN 


0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 
Energy (MeV) 


Backscattering spectra of Si diffused into Nb and Si as deposited on 
Nb substrate. Adapted from P. D. Stupik, M. M. Donovan, A. R. 
Barron, T. R. Jervis and M. Nastasi, Thin Solid Films, 1992, 207, 

138, 


We can use ratio of the heights Hs;/Hyp of the backscattering spectrum after formation of 
NbSi, to determine the composition of the silicide layer. The stoichiometric ratio of Nb and 
Si can be approximated as, 


Ngi/Nnp © LHs; * si ]/LHNb * Onp] 


Hence the concentration of Si and Nb can be determined if we can know the appropriate 
cross sections 0s; and Oj. However the yield in the backscattering spectra is better 
represented as the product of signal height and the energy width AE. Thus stoichiometric 
ratio can be better approximated as 


Ngi/Nnp * [Hi * AEs; * osi]/LH np * AENp * Onp] 


Limitations 


It is of interest to understand the limitations of the backscattering technique in terms of the 
comparison with other thin film analysis technique such as AES, XPS and SIMS ([link]). 
AES has better mass resolution, lateral resolution and depth resolution than RBS. But AES 
suffers from sputtering artifacts. Compared to RBS, SIMS has better sensitivity. RBS does 
not provide any chemical bonding information which we can get from XPS. Again, 
sputtering artifact problems are also associated in XPS. The strength of RBS lies in 
quantitative analysis. However, conventional RBS systems cannot analyze ultrathin films 
since the depth resolution is only about 10 nm using surface barrier detector. 


Summary 


Rutherford Backscattering analysis is a straightforward technique to determine the thickness 
and composition of thin films (< 4000 A). Areas that have been lately explored are the use 
of backscattering technique in composition determination of new superconductor oxides; 
analysis of lattice mismatched epitaxial layers, and as a probe of thin film morphology and 
surface clustering. 
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The Application of VSI (Vertical Scanning Interferometry) to the Study of 
Crystal Surface Processes 


Introduction 


The processes which occur at the surfaces of crystals depend on many 
external and internal factors such as crystal structure and composition, 
conditions of a medium where the crystal surface exists and others. The 
appearance of a crystal surface is the result of complexity of interactions 
between the crystal surface and the environment. The mechanisms of 
surface processes such as dissolution or growth are studied by the physical 
chemistry of surfaces. There are a lot of computational techniques which 
allows us to predict the changing of surface morphology of different 
minerals which are influenced by different conditions such as temperature, 
pressure, pH and chemical composition of solution reacting with the 
surface. For example, Monte Carlo method is widely used to simulate the 
dissolution or growth of crystals. However, the theoretical models of 
surface processes need to be verified by natural observations. We can 
extract a lot of useful information about the surface processes through 
studying the changing of crystal surface structure under influence of 
environmental conditions. The changes in surface structure can be studied 
through the observation of crystal surface topography. The topography can 
be directly observed macroscopically or by using microscopic techniques. 
Microscopic observation allows us to study even very small changes and 
estimate the rate of processes by observing changing the crystal surface 
topography in time. 


Much laboratory worked under the reconstruction of surface changes and 
interpretation of dissolution and precipitation kinetics of crystals. Invention 
of AFM made possible to monitor changes of surface structure during 
dissolution or growth. However, to detect and quantify the results of 
dissolution processes or growth it is necessary to determine surface area 
changes over a significantly larger field of view than AFM can provide. 
More recently, vertical scanning interferometry (VSI) has been developed 
as new tool to distinguish and trace the reactive parts of crystal surfaces. 
VSI and AFM are complementary techniques and practically well suited to 
detect surface changes. 


VSI technique provides a method for quantification of surface topography 
at the angstrom to nanometer level. Time-dependent VSI measurements can 
be used to study the surface-normal retreat across crystal and other solid 
surfaces during dissolution process. Therefore, VSI can be used to directly 
and nondirectly measure mineral dissolution rates with high precision. 
Analogically, VSI can be used to study kinetics of crystal growth. 


Physical principles of optical interferometry 


Optical interferometry allows us to make extremely accurate measurements 
and has been used as a laboratory technique for almost a hundred years. 
Thomas Young observed interference of light and measured the wavelength 
of light in an experiment, performed around 1801. This experiment gave an 
evidence of Young's arguments for the wave model for light. The discovery 
of interference gave a basis to development of interferomertry techniques 
widely successfully used as in microscopic investigations, as in astronomic 
investigations. 


The physical principles of optical interferometry exploit the wave properties 
of light. Light can be thought as electromagnetic wave propagating through 
space. If we assume that we are dealing with a linearly polarized wave 
propagating in a vacuum in z direction, electric field E can be represented 
by a sinusoidal function of distance and time. 

Equation: 


E(x,y,z,t) = acos|2n(vt — z/A)| 


Where a is the amplitude of the light wave, v is the frequency, and J is its 
wavelength. The term within the square brackets is called the phase of the 
wave. Let’s rewrite this equation in more compact form, 

Equation: 


E(a,y,z,t) = acos[wt — kz] 


where w = 2rv is the circular frequency, and k = 2r/A is the propagation 
constant. Let’s also transform this second equation into a complex 


exponential form, 
Equation: 


E(x,y,z,t) = Re{aexp(id)exp(iwt)} = Re{ Aexp(iwt) } 


where @ = 2nz/X and A = exp(—i¢@) is known as the complex amplitude. 
If n is a refractive index of a medium where the light propagates, the light 
wave traverses a distance d in such a medium. The equivalent optical path 
in this case is 

Equation: 


p=n-d 


When two light waves are superposed, the result intensity at any point 
depends on whether reinforce or cancel each other ({link]). This is well 
known phenomenon of interference. We will assume that two waves are 
propagating in the same direction and are polarized with their field vectors 
in the same plane. We will also assume that they have the same frequency. 
The complex amplitude at any point in the interference pattern is then the 
sum of the complex amplitudes of the two waves, so that we can write, 
Equation: 


A=A,+ Ap. 


where A, = ayexp(—i¢,) and Az = a,exp(—id,) are the complex 
amplitudes of two waves. The resultant intensity is, therefore, 
Equation: 


L= | A |? =y+Io+ 2(I,I2)/*cosAd 


where J; and J2 are the intensities of two waves acting separately, and 
Ad = ¢1 — ¢¢ is the phase difference between them. If the two waves are 
derived from a common source, the phase difference corresponds to an 
optical path difference, 

Equation: 


Ap = (A/2n) Ad 


Two waves in phase 


Interfering waves coming 
from two point sources 


Two waves out of phase 


rt, 


The scheme of interferometric wave interaction when two 
waves interact with each other, the amplitude of resulting 
wave will increase or decrease. The value of this amplitude 
depends on phase difference between two original waves. 


If Ad, the phase difference between the beams, varies linearly across the 
field of view, the intensity varies cosinusoidally, giving rise to alternating 
light and dark bands or fringes ([link]). The intensity in an interference 
pattern has its maximum value 

Equation: 


Imax = Th + Ip + 2(IyI2)¥? 


when Ad = 2mn, where m is an integer and its minimum value 
Equation: 


Tin = Ty + Ty — 2(E, IQ)? 


when Ad = (2m + 1)z. 


The principle of interferometry is widely used to develop many types of 
interferometric set ups. One of the earliest set ups is Michelson 
interferometry. The idea of this interferometry is quite simple: interference 
fringes are produced by splitting a beam of monochromatic light so that one 
beam strikes a fixed mirror and the other a movable mirror. An interference 
pattern results when the reflected beams are brought back together. The 
Michelson interferometric scheme is shown in [link]. 


Fixed mirror 


Movable mirror 


Incident beam 


Detector 


Schematic representation of a Michelson interferometry 
set-up. 


The difference of path lengths between two beams is 2x because beams 
traverse the designated distances twice. The interference occurs when the 
path difference is equal to integer numbers of wavelengths, 

Equation: 


ApS 2s => mA,m= Oye 1, = 2a 


Modern interferometric systems are more complicated. Using special phase- 
measurement techniques they capable to perform much more accurate 
height measurements than can be obtained just by directly looking at the 
interference fringes and measuring how they depart from being straight and 
equally spaced. Typically interferometric system consist of lights source, 
beamsplitter, objective system, system of registration of signals and 
transformation into digital format and computer which process data. 
Vertical scanning interferometry is contains all these parts. [link] shows a 
configuration of VSI interferometric system. 
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Schematic representation of the Vertical scanning interferometry 
(VSI) system. 


Many of modern interferometric systems use Mirau objective in their 
constructions. Mireau objective is based on a Michelson interferometer. 
This objective consists of a lens, a reference mirror and a beamsplitter. The 
idea of getting interfering beams is simple: two beams (red lines) travel 
along the optical axis. Then they are reflected from the reference surface 
and the sample surface respectively (blue lines). After this these beams are 


recombined to interfere with each other. An illumination or light source 
system is used to direct light onto a sample surface through a cube beam 
splitter and the Mireau objective. The sample surface within the field of 
view of the objective is uniformly illuminated by those beams with different 
incidence angles. Any point on the sample surface can reflect those incident 
beams in the form of divergent cone. Similarly, the point on the reference 
symmetrical with that on the sample surface also reflects those illuminated 
beams in the same form. 


The Mireau objective directs the beams reflected of the reference and the 
sample surface onto a CCD (charge-coupled device) sensor through a tube 
lens. The CCD sensor is an analog shift register that enables the 
transportation of analog signals (electric charges) through successive stages 
(capacitors), controlled by a clock signal. The resulting interference fringe 
pattern is detected by CCD sensor and the corresponding signal is digitized 
by a frame grabber for further processing with a computer. 


The distance between a minimum and a maximum of the interferogram 
produced by two beams reflected from the reference and sample surface is 
known. That is, exactly half the wavelength of the light source. Therefore, 
with a simple interferogram the vertical resolution of the technique would 
be also limited to A/2. If we will use a laser light as a light source with a 
wavelength of 300 nm the resolution would be only 150 nm. This resolution 
is not good enough for a detailed near-atomic scale investigation of crystal 
surfaces. Fortunately, the vertical resolution of the technique can be 
improved significantly by moving either the reference or the sample by a 
fraction of the wavelength of the light. In this way, several interferograms 
are produced. Then they are all overlayed, and their phase shifts compared 
by the computer software [link]. This method is widely known as phase 
shift interferometry (PSI). 
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Sketch illustrating phase-shift technology. The sample is 
continuously moved along the vertical axes in order to scan surface 
topography. All interferograms are automatically overlayed using 
computer software. 


Most optical testing interferometers now use phase-shifting techniques not 
only because of high resolution but also because phase-shifting is a high 
accuracy rapid way of getting the interferogram information into the 
computer. Also usage of this technique makes the inherent noise in the data 
taking process very low. As the result in a good environment angstrom or 
sub-angstrom surface height measurements can be performed. As it was 
said above, in phase-shifting interferometry the phase difference between 
the interfering beams is changed at a constant rate as the detector is read 
out. Once the phase is determined across the interference field, the 


corresponding height distribution on the sample surface can be determined. 
The phase distribution @(x, y) is recorded by using the CCD camera. 


Let’s assign A(x, y), B(x, y), C(x, y) and D(x, y) to the resulting interference 
light intensities which are corresponded to phase-shifting steps of 0, 7/2, 7 
and 37/2. These intensities can be obtained by moving the reference mirror 
through displacements of 1/8, A/4 and 3//8, respectively. The equations for 
the resulting intensities would be: 


Equation: 

A(az,y) = I1(x,y) + Io(z,y)cosa(x,y) 
Equation: 

B(z,y) = Ii(2,y) — In(z,y)sina(z,y) 
Equation: 

C(z,y) = l(«,y) — In(x,y)cosa(x,y) 
Equation: 


D(a,y) = I1(x,y) + Io(x,y)sina(z,y) 


where I; (x,y)and I2(x,y) are two overlapping beams from two symmetric 
points on the test surface and the reference respectively. Solving equations 
[link]—[link], the phase map @(x, y) of a sample surface will be given by the 
relation: 

Equation: 


Once the phaseis determined across the interference field pixel by pixel on 
a two-dimensional CCD array, the local height distribution/contour, h(x, y), 
on the test surface is given by 

Equation: 


Normally the resulted fringe can be in the form of a linear fringe pattern by 
adjusting the relative position between the reference mirror and sample 
surfaces. Hence any distorted interference fringe would indicate a local 
profile/contour of the test surface. 


It is important to note that the Mireau objective is mounted on a capacitive 
closed-loop controlled PZT (piezoelectric actuator) as to enable phase 
shifting to be accurately implemented. The PZT is based on piezoelectric 
effect referred to the electric potential generated by applying pressure to 
piezoelectric material. This type of materials is used to convert electrical 
energy to mechanical energy and vice-versa. The precise motion that results 
when an electric potential is applied to a piezoelectric material has an 
importance for nanopositioning. Actuators using the piezo effect have been 
commercially available for 35 years and in that time have transformed the 
world of precision positioning and motion control. 


Vertical scanning interferometer also has another name; white-light 
interferometry (WLI) because of using the white light as a source of light. 
With this type of source a separate fringe system is produced for each 
wavelength, and the resultant intensity at any point of examined surface is 
obtained by summing these individual patterns. Due to the broad bandwidth 
of the source the coherent length L of the source is short: 

Equation: 


2 
L= 
nAr 


where A is the center wavelength, n is the refractive index of the medium, 
AA is the spectral width of the source. In this way good contrast fringes can 
be obtained only when the lengths of interfering beams pathways are closed 
to each other. If we will vary the length of a pathway of a beam reflected 
from sample, the height of a sample can be determined by looking at the 
position for which a fringe contrast is a maximum. In this case interference 


pattern exist only over a very shallow depth of the surface. When we vary a 
pathway of sample-reflected beam we also move the sample in a vertical 
direction in order to get the phase at which maximum intensity of fringes 
will be achieved. This phase will be converted in height of a point at the 
sample surface. 


The combination of phase shift technology with white-light source provides 
a very powerful tool to measure the topography of quite rough surfaces with 
the amplitude in heights about and the precision up to 1-2 nm. Through a 
developed software package for quantitatively evaluating the resulting 
interferogram, the proposed system can retrieve the surface profile and 
topography of the sample objects [link]. 


67.5 um 


Example of muscovite surface topography, obtained by using VSI- 50x 


objective. 


A comparison of common methods to determine surface 
topography: SEM, AFM and VSI 


Except the interferometric methods described above, there are a several 
other microscopic techniques for studying crystal surface topography. The 
most common are scanning electron microscopy (SEM) and atomic force 
microscopy (AFM). All these techniques are used to obtain information 
about the surface structure. However they differ from each other by the 
physical principles on which they based. 


Scanning electron microscopy 


SEM allows us to obtain images of surface topography with the resolution 
much higher than the conventional light microscopes do. Also it is able to 
provide information about other surface characteristics such as chemical 
composition, electrical conductivity etc, see [link]. All types of data are 
generated by the reflecting of accelerated electron beams from the sample 
surface. When electrons strike the sample surface, they lose their energy by 
repeated random scattering and adsorption within an outer layer into the 
depth varying from 100 nm to 5 microns. 


Incident beam 


X-rays 


Chemical composition of the surface Elastic backscattered electrons 


Atomic number and toporgaphy 


Cathodoluminescence Inelastic backscattered electrons 
Electrical information Surface toporgaphy 


NZ 


Scheme of electron beam-sample interaction at SEM analysis 


The thickness of this outer layer also knows as interactive layer depends on 
energy of electrons in the beam, composition and density of a sample. 
Result of the interaction between electron beam and the surface provides 
several types of signals. The main type is secondary or inelastic scattered 
electrons. They are produced as a result of interaction between the beam of 
electrons and weakly bound electrons in the conduction band of the sample. 
Secondary electrons are ejected from the k-orbitals of atoms within the 
surface layer of thickness about a few nanometers. This is because 
secondary electrons are low energy electrons (<50 eV), so only those 
formed within the first few nanometers of the sample surface have enough 
energy to escape and be detected. Secondary backscattered electrons 
provide the most common signal to investigate surface topography with 
lateral resolution up to 0.4 - 0.7 nm. 


High energy beam electrons are elastic scattered back from the surface. This 
type of signal gives information about chemical composition of the surface 
because the energy of backscattered electrons depends on the weight of 
atoms within the interaction layer. Also this type of electrons can form 
secondary electrons and escape from the surface or travel father into the 
sample than the secondary. The SEM image formed is the result of the 
intensity of the secondary electron emission from the sample at each x,y 
data point during the scanning of the surface. 


Atomic force microscopy 


AFM is a very popular tool to study surface dissolution. AFM set up 
consists of scanning a sharp tip on the end of a flexible cantilever which 
moves across a sample surface. The tips typically have an end radius of 2 to 
20 nm, depending on tip type. When the tip touch the surface the forces of 
these interactions leads to deflection of a cantilever. The interaction 
between tip and sample surface involve mechanical contact forces, van der 
Waals forces, capillary forces, chemical bonding, electrostatic forces, 
magnetic forces etc. The deflection of a cantilever is usually measured by 
reflecting a laser beam off the back of the cantilever into a split photodiode 
detector. A schematic drawing of AFM can be seen in [link]. The two most 
commonly used modes of operation are contact mode AFM and tapping 
mode AFM, which are conducted in air or liquid environments. 


Photodetector 
Laser beam 


Cantilever 


Schematic drawing of an AFM apparatus. 


Working under the contact mode AFM scans the sample while monitoring 
the change in cantilever deflection with the split photodiode detector. Loop 
maintains a constant cantilever reflection by vertically moving the scanner 


to get a constant signal. The distance which the scanner goes by moving 
vertically at each x,y data point is stored by the computer to form the 
topographic image of the sample surface. Working under the tapping mode 
AFM oscillates the cantilever at its resonance frequency (typically~300 
kHz) and lightly “taps” the tip on the surface during scanning. The 
electrostatic forces increase when tip gets close to the sample surface, 
therefore the amplitude of the oscillation decreases. The laser deflection 
method is used to detect the amplitude of cantilever oscillation. Similar to 
the contact mode, feedback loop maintains a constant oscillation amplitude 
by moving the scanner vertically at every x,y data point. Recording this 
movement forms the topographical image. The advantage of tapping mode 
over contact mode is that it eliminates the lateral, shear forces present in 
contact mode. This enables tapping mode to image soft, fragile, and 
adhesive surfaces without damaging them while work under contact mode 
allows the damage to occur. 


Comparison of techniques 


All techniques described above are widely used in studying of surface nano- 
and micromorphology. However, each method has its own limitations and 
the proper choice of analytical technique depends on features of analyzed 
surface and primary goals of research. 


All these techniques are capable to obtain an image of a sample surface 
with quite good resolution. The lateral resolution of VSI is much less, then 
for other techniques: 150 nm for VSI and 0.5 nm for AFM and SEM. 
Vertical resolution of AFM (0.5 A) is better then for VSI (1 - 2 nm), 
however VSI is capable to measure a high vertical range of heights (1 mm) 
which makes possible to study even very rough surfaces. On the contrary, 
AFM allows us to measure only quite smooth surfaces because of its 
relatively small vertical scan range (7 pm). SEM has less resolution, than 
AFM because it requires coating of a conductive material with the thickness 
within several nm. 


The significant advantage of VSI is that it can provide a large field of view 
(845 x 630 pm for 10x objective) of tested surfaces. Recent studies of 


surface roughness characteristics showed that the surface roughness 
parameters increase with the increasing field of view until a critical size of 
250,000 pm is reached. This value is larger then the maximum field of view 
produced by AFM (100 x 100 pm) but can be easily obtained by VSI. SEM 
is also capable to produce images with large field of view. However, SEM 
is able to provide only 2D images from one scan while AFM and VSI let us 
to obtain 3D images. It makes quantitative analysis of surface topography 
more complicated, for example, topography of membranes is studied by 
cross section and top view images. 


VSI AFM SEM 
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10 
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of a sample a conducted material 
Required Air Alr, . Vacuum 
environment liquid 


A comparison of VSI sample and resolution with AFM and SEM. 


The experimental studying of surface processes using 
microscopic techniques 


The limitations of each technique described above are critically important 
to choose appropriate technique for studying surface processes. Let’s 
explore application of these techniques to study dissolution of crystals. 


When crystalline matter dissolves the changes of the crystal surface 
topography can be observed by using microscopic techniques. If we will 
apply an unreactive mask (silicon for example) on crystal surface and place 
a crystalline sample into the experiment reactor then we get two types of 
surfaces: dissolving and remaining the same or unreacted. After some 
period of time the crystal surface starts to dissolve and change its z-level. In 
order to study these changes ex situ we can pull out a sample from the 
reaction cell then remove a mask and measure the average height difference 
Ahbetween the unreacted and dissolved areas. The average heights of 
dissolved and unreacted areas are obtained through digital processing of 
data obtained by microscopes. The velocity of normal surface retreat Usnr 
during the time interval At is defined as 


Ah 
USNR = At 


Dividing this velocity by the molar volume V(cm*/mol) gives a global 
dissolution rate in the familiar units of moles per unit area per unit time: 
Equation: 


USNR 
V 


Pe 


This method allows us to obtain experimental values of dissolution rates 
just by precise measuring of average surface heights. Moreover, using this 
method we can measure local dissolution rates at etch pits by monitoring 
changes in the volume and density of etch pits across the surface over time. 
VSI technique is capable to perform these measurements because of large 
vertical range of scanning. In order to get precise values of rates which are 


not depend on observing place of crystal surface we need to measure 
enough large areas. VSI technique provides data from areas which are large 
enough to study surfaces with heterogeneous dissolution dynamics and 
obtain average dissolution rates. Therefore, VSI makes possible to measure 
rates of normal surface retreat during the dissolution and observe formation, 
growth and distribution of etch pits on the surface. 


However, if the mechanism of dissolution is controlled by dynamics of 
atomic steps and kink sites within a smooth atomic surface area, the 
observation of the dissolution process need to use a more precise technique. 
AFM is capable to provide information about changes in step morphology 
in situ when the dissolution occurs. For example, immediate response of the 
dissolved surface to the changing of environmental conditions 
(concentrations of ions in the solution, pH etc.) can be studied by using 
AFM. 


SEM is also used to examine micro and nanotexture of solid surfaces and 
study dissolution processes. This method allows us to observe large areas of 
crystal surface with high resolution which makes possible to measure a high 
variety of surfaces. The significant disadvantage of this method is the 
requirement to cover examine sample by conductive substance which limits 
the resolution of SEM. The other disadvantage of SEM is that the analysis 
is conducted in vacuum. Recent technique, environmental SEM or ESEM 
overcomes these requirements and makes possible even examine liquids 
and biological materials. The third disadvantage of this technique is that it 
produces only 2D images. This creates some difficulties to measure Ah 
within the dissolving area. One of advantages of this technique is that it is 
able to measure not only surface topography but also chemical composition 
and other surface characteristics of the surface. This fact is used to monitor 
changing in chemical composition during the dissolution. 
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Atomic Force Microscopy 


Introduction 


Atomic force microscopy (AFM) is a high-resolution form of scanning 
probe microscopy, also known as scanning force microscopy (SFM). The 
instrument uses a cantilever with a sharp tip at the end to scan over the 
sample surface ({link]). As the probe scans over the sample surface, 
attractive or repulsive forces between the tip and sample, usually in the 
form of van der Waal forces but also can be a number of others such as 
electrostatic and hydrophobic/hydrophilic, cause a deflection of the 
cantilever. The deflection is measured by a laser ({link]) which is reflected 
off the cantilever into photodiodes. As one of the photodiodes collects more 
light, it creates an output signal that is processed and provides information 
about the vertical bending of the cantilever. This data is then sent to a 
scanner that controls the height of the probe as it moves across the surface. 
The variance in height applied by the scanner can then be used to produce a 
three-dimensional topographical representation of the sample. 


Cantilever 
with Tip 


=> 
Sample 


Simple schematic of 


atomic force microscope 
(AFM) apparatus. 
Adapted from H. G. 
Hansma, Department of 
Physics, University of 
California, Santa Barbara. 


Modes of operation 


Contact mode 


The contact mode method utilizes a constant force for tip-sample 
interactions by maintaining a constant tip deflection ([link].). The tip 
communicates the nature of the interactions that the probe is having at the 
surface via feedback loops and the scanner moves the entire probe in order 
to maintain the original deflection of the cantilever. The constant force is 
calculated and maintained by using Hooke's Law, [link]. This equation 
relates the force (F), spring constant (k), and cantilever deflection (x). Force 
constants typically range from 0.01 to 1.0 N/m. Contact mode usually has 
the fastest scanning times but can deform the sample surface. It is also only 
the only mode that can attain "atomic resolution." 

Equation: 


F = -kx 


Schematic diagram of 


probe and surface 
interaction in contact 
mode. 


Tapping mode 


In the tapping mode the cantilever is externally oscillated at its fundamental 
resonance frequency ({link]). A piezoelectric on top of the cantilever is used 
to adjust the amplitude of oscillation as the probe scans across the surface. 
The deviations in the oscillation frequency or amplitude due to interactions 
between the probe and surface are measured, and provide information about 
the surface or types of material present in the sample. This method is 
gentler than contact AFM since the tip is not dragged across the surface, but 
it does require longer scanning times. It also tends to provide higher lateral 
resolution than contact AFM. 


fia 
ee 


Diagram of probe and 
surface interaction in 
tapping mode. 


Noncontact mode 


For noncontact mode the cantilever is oscillated just above its resonance 
frequency and this frequency is decreased as the tip approaches the surface 
and experiences the forces associated with the material ((link]). The average 
tip-to-sample distance is measured as the oscillation frequency or amplitude 
is kept constant, which then can be used to image the surface. This method 
exerts very little force on the sample, which extends the lifetime of the tip. 
However, it usually does not provide very good resolution unless placed 
under a strong vacuum. 


Diagram of probe and 
surface interaction in 
noncontact mode. 


Experimental limitations 


A common problem seen in AFM images is the presence of artifacts which 
are distortions of the actual topography, usually either due to issues with the 
probe, scanner, or image processing. The AFM scans slowly which makes it 
more susceptible to external temperature fluctuations leading to thermal 
drift. This leads to artifacts and inaccurate distances between topographical 
features. 


It is also important to consider that the tip is not perfectly sharp and 
therefore may not provide the best aspect ratio, which leads to a 
convolution of the true topography. This leads to features appearing too 
large or too small since the width of the probe cannot precisely move 
around the particles and holes on the surface. It is for this reason that tips 


with smaller radii of curvature provide better resolution in imaging. The tip 
can also produce false images and poorly contrasted images if it is blunt or 
broken. 


The movement of particles on the surface due to the movement of the 
cantilever can cause noise, which forms streaks or bands in the image. 
Artifacts can also be made by the tip being of inadequate proportions 
compared to the surface being scanned. It is for this reason that it is 
important to use the ideal probe for the particular application. 


Sample size and preparation 


The sample size varies with the instrument but a typical size is 8 mm by 8 
mm with a typical height of 1 mm. Solid samples present a problem for 
AFM since the tip can shift the material as it scans the surface. Solutions or 
dispersions are best for applying as uniform of a layer of material as 
possible in order to get the most accurate value of particles’ heights. This is 
usually done by spin-coating the solution onto freshly cleaved mica which 
allows the particles to stick to the surface once it has dried. 


Applications of AFM 


AFM is particularly versatile in its applications since it can be used in 
ambient temperatures and many different environments. It can be used in 
many different areas to analyze different kinds of samples such as 
semiconductors, polymers, nanoparticles, biotechnology, and cells amongst 
others. The most common application of AFM is for morphological studies 
in order to attain an understanding of the topography of the sample. Since it 
is common for the material to be in solution, AFM can also give the user an 
idea of the ability of the material to be dispersed as well as the homogeneity 
of the particles within that dispersion. It also can provide a lot of 
information about the particles being studied such as particle size, surface 
area, electrical properties, and chemical composition. Certain tips are 
capable of determining the principal mechanical, magnetic, and electrical 
properties of the material. For example, in magnetic force microscopy 
(MFM) the probe has a magnetic coating that senses magnetic, electrostatic, 
and atomic interactions with the surface. This type of scanning can be 


performed in static or dynamic mode and depicts the magnetic structure of 
the surface. 


AFM of carbon nanotubes 


Atomic force microscopy is usually used to study the topographical 
morphology of these materials. By measuring the thickness of the material 
it is possible to determine if bundling occurred and to what degree. Other 
dimensions of the sample can also be measured such as the length and 
width of the tubes or bundles. It is also possible to detect impurities, 
functional groups ({link]), or remaining catalyst by studying the images. 
Various methods of producing nanotubes have been found and each 
demonstrates a slightly different profile of homogeneity and purity. These 
impurities can be carbon coated metal, amorphous carbon, or other 
allotropes of carbon such as fullerenes and graphite. These facts can be 
utilized to compare the purity and homogeneity of the samples made from 
different processes, as well as monitor these characteristics as different 
steps or reactions are performed on the material. The distance between the 
tip and the surface has proven itself to be an important parameter in 
noncontact mode AFM and has shown that if the tip is moved past the 
threshold distance, approximately 30 pm, it can move or damage the 
nanotubes. If this occurs, a useful characterization cannot be performed due 
to these distortions of the image. 


0 
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AFM image of a 
polyethyleneimine- 
functionalized single walled 
carbon nanotube (PEI-SWNT) 
showing the presence of PEI 
“globules” on the SWNT. 
Adapted from E. P. Dillon, C. 
A. Crouse, and A. R. Barron, 
ACS Nano, 2008, 2, 156. 


AFM of fullerenes 


Atomic force microscopy is best applied to aggregates of fullerenes rather 
than individual ones. While the AFM can accurately perform height 
analysis of individual fullerene molecules, it has poor lateral resolution and 
it is difficult to accurately depict the width of an individual molecule. 
Another common issue that arises with contact AFM and fullerene 
deposited films is that the tip shifts clusters of fullerenes which can lead to 
discontinuities in sample images. 
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Introduction to Nanoparticle Synthesis 


The fabrication of nanomaterials with strict control over size, shape, and 
crystalline structure has inspired the application of nanochemistry to 
numerous fields including catalysis, medicine, and electronics. The use of 
nanomaterials in such applications also requires the development of 
methods for nanoparticle assembly or dispersion in various media. A 
majority of studies have been aimed at dispersion in aqueous media aimed 
at their use in medical applications and studies of environmental effects, 
however, the principles of nanoparticle fabrication and functionalization of 
nanoparticles transcends their eventual application. Herein, we review the 
most general routes to nanoparticles of the key types that may have 
particular application within the oil and gas industry for sensor, composite, 
or device applications. 


Synthesis methods for nanoparticles are typically grouped into two 
categories: “top-down” and “bottom-up”. The first involves division of a 
massive solid into smaller portions. This approach may involve milling or 
attrition, chemical methods, and volatilization of a solid followed by 
condensation of the volatilized components. The second, “bottom-up”, 
method of nanoparticle fabrication involves condensation of atoms or 
molecular entities in a gas phase or in solution. The latter approach is far 
more popular in the synthesis of nanoparticles. 


Dispersions of nanoparticles are intrinsically thermodynamically 
metastable, primarily due to their very high surface area, which represents a 
positive contribution to the free enthalpy of the system. If the activation 
energies are not sufficiently high, evolution of the nanoparticle dispersion 
occurs causing an increase in nanoparticle size as typified by an Ostwald 
ripening process. Thus, highly dispersed nanoparticles are only kinetically 
stabilized and cannot be prepared under conditions that exceed some 
threshold, meaning that so-called “soft-chemical” or “chemie duce” 
methods are preferred. In addition, the use of surface stabilization is 
employed in many nanomaterials to hinder sintering, recrystallization and 
aggregation. 
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Synthesis of Semiconductor Nanoparticles 


The most studied non-oxide semiconductors are cadmium chalcogenides 
(CdE, with E = sulfide, selenide and telluride). CdE nanocrystals were 
probably the first material used to demonstrate quantum size effects 
corresponding to a change in the electronic structure with size, i.e., the 
increase of the band gap energy with the decrease in size of particles 
({link]). These semiconductors nanocrystals are commonly synthesized by 
thermal decomposition of an organometallic precursor dissolved in an 
anhydrous solvent containing the source of chalcogenide and a stabilizing 
material (polymer or capping ligand). Stabilizing molecules bound to the 
surface of particles control their growth and prevent particle aggregation. 


Picture of cadmium selenide 
(CdSe) quantum dots, dissolved 
in toluene, fluorescing brightly, 

as they are exposed to an 
ultraviolet lamp, in three 
noticeable different colors (blue 
~481 nm, green ~520 nm, and 
orange ~612 nm) due to the 
quantum dots' bandgap (and 
thus the wavelength of emitted 
light) depends strongly on the 
particle size; the smaller the dot, 


the shorter the emitted 
wavelength of light. The "blue" 
quantum dots have the smallest 
particle size, the "green" dots 
are slightly larger, and the 
"orange" dots are the largest. 


Although cadmium chalcogenides are the most studies semiconducting 
nanoparticles, the methodology for the formation of semiconducting 
nanoparticles was first demonstrated independently for InP and GaAs, e.g., 
[link]. This method has been adapted for a range of semiconductor 
nanoparticles. 

Equation: 


InCl, + P(SiMe;); > InP + 3 Me,SiCl 


In the case of CdE, dimethylcadmium Cd(CHs3), is used as a cadmium 
source and bis(trimethylsilyl)sulfide, (Me3Si)2S, trioctylphosphine selenide 
or telluride (TOPSe, TOPTe) serve as sources of selenide in 
trioctylphosphine oxide (TOPO) used as solvent and capping molecule. The 
mixture is heated at 230-260 °C over a few hours while modulating the 
temperature in response to changes in the size distribution as estimated 
from the absorption spectra of aliquots removed at regular intervals. These 
particles, capped with TOP/TOPO molecules, are non-aggregated ([link]) 
and easily dispersible in organic solvents forming optically clear 
dispersions. When similar syntheses are performed in the presence of 
surfactant, strongly anisotropic nanoparticles are obtained, e.g., rod-shaped 
CdSe nanoparticles can be obtained. 


TEM image of CdSe 
nanoparticles. 


Because Cd(CH3)> is extremely toxic, pyrophoric and explosive at elevated 
temperature, other Cd sources have been used. CdO appears to be an 
interesting precursor. CdO powder dissolves in TOPO and HPA or TDPA 
(tetradecylphosphonic acid) at about 300 °C giving a colorless 
homogeneous solution. By introducing selenium or tellurium dissolved in 
TOP, nanocrystals grow to the desired size. 


Nanorods of CdSe or CdTe can also be produced by using a greater initial 
concentration of cadmium as compared to reactions for nanoparticles. This 
approach has been successfully applied for synthesis of numerous other 
metal chalcogenides including ZnS, ZnSe, and Zn,_,Cd,S. Similar 
procedures enable the formation of MnS, PdS, NiS, Cu»S nanoparticles, 
nano rods, and nano disks. 
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Optical Properties of Group 12-16 (II-VI) Semiconductor Nanoparticles 


What are Group 12-16 semiconductors? 


Semiconductor materials are generally classified on the basis of the periodic 
table group that their constituent elements belong to. Thus, Group 12-16 
semiconductors, formerly called II-VI semiconductors, are materials whose 
cations are from the Group 12 and anions are from Group 16 in the periodic 
table ([link]). Some examples of Group 12-16 semiconductor materials are 
cadmium selenide (CdSe), zinc sulfide (ZnS), cadmium teluride (CdTe), 
zinc oxide (ZnO), and mercuric selenide (HgSe) among others. 


Note:The new IUPAC (International Union of Pure and Applied 
Chemistry) convention is being followed in this document, to avoid any 
confusion with regard to conventions used earlier. In the old IUPAC 
convention, Group 12 was known as Group IIB with the roman numeral 
‘II’ referring to the number of electrons in the outer electronic shells and B 
referring to being on the right part of the table. However, in the CAS 
(Chemical Abstracts Service), the alphabet B refers to transition elements 
as compared to main group elements, though the roman numeral has the 
same meaning. Similarly, Group 16 was earlier known as Group VI 
because all the elements in this group have 6 valence shell electrons. 


The red box 
indicates the Group 
12 and Group 16 
elements in the 
periodic table. 


What are Group 12-16 (II-VI) semiconductor nanoparticles? 


From the Greek word nanos - meaning "dwarf" this prefix is used in the 
metric system to mean 10°° or one billionth (1/1,000,000,000). Thus a 
nanometer is 10°? or one billionth of a meter, and a nanojoule is 10°? or one 
billionth of a Joule, etc. A nanoparticle is ordinarily defined as any particle 
with at least one of its dimensions in the 1 - 100 nm range. 


Nanoscale materials often show behavior which is intermediate between 
that of a bulk solid and that of an individual molecule or atom. An inorganic 
nanocrystal can be imagined to be comprised of a few atoms or molecules. 
It thus will behave differently from a single atom; however, it is still smaller 
than a macroscopic solid, and hence will show different properties. For 
example, if one would compare the chemical reactivity of a bulk solid anda 
nanoparticle, the latter would have a higher reactivity due to a significant 
fraction of the total number of atoms being on the surface of the particle. 
Properties such as boiling point, melting point, optical properties, chemical 
stability, electronic properties, etc. are all different in a nanoparticle as 
compared to its bulk counterpart. In the case of Group 12-16 
semiconductors, this reduction in size from bulk to the nanoscale results in 
many size dependent properties such as varying band gap energy, optical 
and electronic properties. 


Optical properties of semiconductor quantum nanoparticles 


In the case of semiconductor nanocrystals, the effect of the size on the 
optical properties of the particles is very interesting. Consider a Group 12- 
16 semiconductor, cadmium selenide (CdSe). A 2 nm sized CdSe crystal 


has a blue color fluorescence whereas a larger nanocrystal of CdSe of about 
6 nm has a dark red fluorescence ([link]). In order to understand the size 
dependent optical properties of semiconductor nanoparticles, it is important 
to know the physics behind what is happening at the nano level. 


Fluorescing CdSe 
quantum dots synthesized 
in a heat transfer liquid of 

different sizes (M. S. 
Wong, Rice University). 


Energy levels in a semiconductor 


The electronic structure of any material is given by a solution of 
Schrédinger equations with boundary conditions, depending on the physical 
situation. The electronic structure of a semiconductor ({link]) can be 
described by the following terms: 


Conduction band 


Band Gap =| Eg (bulk) 


Valence band 


Simplified 
representation 
of the energy 
levels ina 
bulk 
semiconductor 


Energy level 


By the solution of Schrédinger’s equations, the electrons in a 
semiconductor can have only certain allowable energies, which are 
associated with energy levels. No electrons can exist in between these 
levels, or in other words can have energies in between the allowed energies. 
In addition, from Pauli’s Exclusion Principle, only 2 electrons with opposite 
spin can exist at any one energy level. Thus, the electrons start filling from 
the lowest energy levels. Greater the number of atoms in a crystal, the 
difference in allowable energies become very small, thus the distance 
between energy levels decreases. However, this distance can never be zero. 
For a bulk semiconductor, due to the large number of atoms, the distance 


between energy levels is very small and for all practical purpose the energy 
levels can be described as continuous ([link]). 


Band gap 


From the solution of Schrédinger’s equations, there are a set of energies 
which is not allowable, and thus no energy levels can exist in this region. 
This region is called the band gap and is a quantum mechanical 
phenomenon ({link]). In a bulk semiconductor the bandgap is fixed; 
whereas in a quantum dot nanoparticle the bandgap varies with the size of 
the nanoparticle. 


Valence band 


In bulk semiconductors, since the energy levels can be considered as 
continuous, they are also termed as energy bands. Valence band contains 
electrons from the lowest energy level to the energy level at the lower edge 
of the bandgap ([link]). Since filling of energy is from the lowest energy 
level, this band is usually almost full. 


Conduction band 


The conduction band consists of energy levels from the upper edge of the 
bandgap and higher ([link]). To reach the conduction band, the electrons in 
the valence band should have enough energy to cross the band gap. Once 
the electrons are excited, they subsequently relax back to the valence band 
(either radiatively or non-radiatively) followed by a subsequent emission of 
radiation. This property is responsible for most of the applications of 
quantum dots. 


Exciton and exciton Bohr radius 


When an electron is excited from the valence band to the conduction band, 
corresponding to the electron in the conduction band a hole (absence of 
electron) is formed in the valence band. This electron pair is called an 
exciton. Excitons have a natural separation distance between the electron 
and hole, which is characteristic of the material. This average distance is 
called exciton Bohr radius. In a bulk semiconductor, the size of the crystal 
is much larger than the exciton Bohr radius and hence the exciton is free to 
move throughout the crystal. 


Energy levels in a quantum dot semiconductor 


Before understanding the electronic structure of a quantum dot 
semiconductor, it is important to understand what a quantum dot 
nanoparticle is. We earlier studied that a nanoparticle is any particle with 
one of its dimensions in the 1 - 100 nm. A quantum dot is a nanoparticle 
with its diameter on the order of the materials exciton Bohr radius. 
Quantum dots are typically 2 - 10 nm wide and approximately consist of 10 
to 50 atoms. With this understanding of a quantum dot semiconductor, the 
electronic structure of a quantum dot semiconductor can be described by the 
following terms. 
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Quantum confinement 


When the size of the semiconductor crystal becomes comparable or smaller 
than the exciton Bohr radius, the quantum dots are in a state of quantum 
confinement. As a result of quantum confinement, the energy levels in a 
quantum dot are discrete ([link]) as opposed to being continuous in a bulk 
crystal ([link]). 


Discrete energy levels 


In materials that have small number of atoms and are considered as 
quantum confined, the energy levels are separated by an appreciable 
amount of energy such that they are not continuous, but are discrete (see 
[link]). The energy associated with an electron (equivalent to conduction 
band energy level) is given by is given by [link], where h is the Planck’s 
constant, mz is the effective mass of electron and n is the quantum number 
for the conduction band states, and n can take the values 1, 2, 3 and so on. 
Similarly, the energy associated with the hole (equivalent to valence band 
energy level) is given by [link], where n' is the quantum number for the 


valence states, and n’ can take the values 1, 2, 3, and so on. The energy 
increases as one goes higher in the quantum number. Since the electron 
mass is much smaller than that of the hole, the electron levels are separated 
more widely than the hole levels. 
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Tunable band gap 


As seen from [link] and [link], the energy levels are affected by the 
diameter of the semiconductor particles. If the diameter is very small, since 
the energy is dependent on inverse of diameter squared, the energy levels of 
the upper edge of the band gap (lowest conduction band level) and lower 
edge of the band gap (highest valence band level) change significantly with 
the diameter of the particle and the effective mass of the electron and the 
hole, resulting in a size dependent tunable band gap. This also results in the 
discretization of the energy levels. 


Qualitatively, this can be understood in the following way. In a bulk 
semiconductor, the addition or removal of an atom is insignificant 
compared to the size of the bulk semiconductor, which consists of a large 
number of atoms. The large size of bulk semiconductors makes the changes 
in band gap so negligible on the addition of an atom, that it is considered as 
a fixed band gap. In a quantum dot, addition of an atom does make a 
difference, resulting in the tunability of band gap. 


UV-visible absorbance 


Due to the presence of discrete energy levels in a QD, there is a widening of 
the energy gap between the highest occupied electronic states and the 
lowest unoccupied states as compared to the bulk material. As a 
consequence, the optical properties of the semiconductor nanoparticles also 
become size dependent. 


The minimum energy required to create an exciton is the defined by the 
band gap of the material, i.e., the energy required to excite an electron from 
the highest level of valence energy states to the lowest level of the 
conduction energy states. For a quantum dot, the bandgap varies with the 
size of the particle. From [link] and [link], it can be inferred that the band 
gap becomes higher as the particle becomes smaller. This means that for a 
smaller particle, the energy required for an electron to get excited is higher. 
The relation between energy and wavelength is given by [link], where h is 
the Planck’s constant, c is the speed of light, A is the wavelength of light. 
Therefore, from [link] to cross a bandgap of greater energy, shorter 
wavelengths of light are absorbed, i.e., a blue shift is seen. 

Equation: 


B= he 
ny 


For Group 12-16 semiconductors, the bandgap energy falls in the UV- 
visible range. That is ultraviolet light or visible light can be used to excite 
an electron from the ground valence states to the excited conduction states. 
In a bulk semiconductor the band gap is fixed, and the energy states are 
continuous. This results in a rather uniform absorption spectrum ((link]a). 
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UV-vis spectra of (a) bulk CdS and (b) 4 nm 
CdS. Adapted from G. Kickelbick, Hybrid 
Materials: Synthesis, Characterization and 

Applications, Wiley-VCH, Weinheim 
(2007). 


In the case of Group 12-16 quantum dots, since the bandgap can be changed 
with the size, these materials can absorb over a range of wavelengths. The 
peaks seen in the absorption spectrum ({link]b) correspond to the optical 
transitions between the electron and hole levels. The minimum energy and 
thus the maximum wavelength peak corresponds to the first exciton peak or 
the energy for an electron to get excited from the highest valence state to 
the lowest conduction state. The quantum dot will not absorb wavelengths 
of energy longer than this wavelength. This is known as the absorption 
onset. 


Fluorescence 


Fluorescence is the emission of electromagnetic radiation in the form of 
light by a material that has absorbed a photon. When a semiconductor 
quantum dot (QD) absorbs a photon/energy equal to or greater than its band 
gap, the electrons in the QD’s get excited to the conduction state. This 
excited state is however not stable. The electron can relax back to its 
ground state by either emitting a photon or lose energy via heat losses. 
These processes can be divided into two categories — radiative decay and 


non-radiative decay. Radiative decay is the loss of energy through the 
emission of a photon or radiation. Non-radiative decay involves the loss of 
heat through lattice vibrations and this usually occurs when the energy 
difference between the levels is small. Non-radiative decay occurs much 
faster than radiative decay. 


Usually the electron relaxes to the ground state through a combination of 
both radiative and non-radiative decays. The electron moves quickly 
through the conduction energy levels through small non-radiative decays 
and the final transition across the band gap is via a radiative decay. Large 
nonradiative decays don’t occur across the band gap because the crystal 
structure can’t withstand large vibrations without breaking the bonds of the 
crystal. Since some of the energy is lost through the non-radiative decay, the 
energy of the emitted photon, through the radiative decay, is much lesser 
than the absorbed energy. As a result the wavelength of the emitted photon 
or fluorescence is longer than the wavelength of absorbed light. This energy 
difference is called the Stokes shift. Due this Stokes shift, the emission peak 
corresponding to the absorption band edge peak is shifted towards a higher 
wavelength (lower energy), i.e., [link]. 
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Absorption spectra (a) 
and emission spectra (b) 
of CdSe tetrapod. 


Intensity of emission versus wavelength is a bell-shaped Gaussian curve. As 
long as the excitation wavelength is shorter than the absorption onset, the 
maximum emission wavelength is independent of the excitation 
wavelength. [link] shows a combined absorption and emission spectrum for 
a typical CdSe tetrapod. 


Factors affecting the optical properties of NPs 


There are various factors that affect the absorption and emission spectra for 
Group 12-16 semiconductor quantum crystals. Fluorescence is much more 
sensitive to the background, environment, presence of traps and the surface 
of the QDs than UV-visible absorption. Some of the major factors 
influencing the optical properties of quantum nanoparticles include: 


e Surface defects, imperfection of lattice, surface charges — The 
surface defects and imperfections in the lattice structure of 
semiconductor quantum dots occur in the form of unsatisfied 
valencies. Similar to surface charges, unsatisfied valencies provide a 
sink for the charge carriers, resulting in unwanted recombinations. 

¢ Surface ligands — The presence of surface ligands is another factor 
that affects the optical properties. If the surface ligand coverage is a 
100%, there is a smaller chance of surface recombinations to occur. 

¢ Solvent polarity — The polarity of solvents is very important for the 
optical properties of the nanoparticles. If the quantum dots are 
prepared in organic solvent and have an organic surface ligand, the 
more non-polar the solvent, the particles are more dispersed. This 
reduces the loss of electrons through recombinations again, since when 
particles come in close proximity to each other, increases the non- 
radiative decay events. 


Applications of the optical properties of Group 12-16 semiconductor 
NPs 


The size dependent optical properties of NP’s have many applications from 
biomedical applications to solar cell technology, from photocatalysis to 
chemical sensing. Most of these applications use the following unique 
properties. 


For applications in the field of nanoelectronics, the sizes of the quantum 
dots can be tuned to be comparable to the scattering lengths, reducing the 
scattering rate and hence, the signal to noise ratio. For Group 12-16 QDs to 
be used in the field of solar cells, the bandgap of the particles can be tuned 
so as to form absorb energy over a large range of the solar spectrum, 
resulting in more number of excitons and hence more electricity. Since the 
nanoparticles are so small, most of the atoms are on the surface. Thus, the 
surface to volume ratio is very large for the quantum dots. In addition to a 
high surface to volume ratio, the Group 12-16 QDs respond to light energy. 
Thus quantum dots have very good photocatalytic properties. Quantum dots 
show fluorescence properties, and emit visible light when excited. This 
property can be used for applications as biomarkers. These quantum dots 
can be tagged to drugs to monitor the path of the drugs. Specially shaped 
Group 12-16 nanoparticles such as hollow shells can be used as drug 
delivery agents. Another use for the fluorescence properties of Group 12-16 
semiconductor QDs is in color-changing paints, which can change colors 
according to the light source used. 
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Characterization of Group 12-16 (II-VI) Semiconductor Nanoparticles by 
UV-visible Spectroscopy 


Quantum dots (QDs) as a general term refer to nanocrystals of 
semiconductor materials, in which the size of the particles are comparable 
to the natural characteristic separation of an electron-hole pair, otherwise 
known as the exciton Bohr radius of the material. When the size of the 
semiconductor nanocrystal becomes this small, the electronic structure of 
the crystal is governed by the laws of quantum physics. Very small Group 
12-16 (II-VI) semiconductor nanoparticle quantum dots, in the order of 2 - 
10 nm, exhibit significantly different optical and electronic properties from 
their bulk counterparts. The characterization of size dependent optical 
properties of Group 12-16 semiconductor particles provide a lot of 
qualitative and quantitative information about them — size, quantum yield, 
monodispersity, shape and presence of surface defects. A combination of 
information from both the UV-visible absorption and fluorescence, 
complete the analysis of the optical properties. 


UV-visible absorbance spectroscopy 


Absorption spectroscopy, in general, refers to characterization techniques 
that measure the absorption of radiation by a material, as a function of the 
wavelength. Depending on the source of light used, absorption spectroscopy 
can be broadly divided into infrared and UV-visible spectroscopy. The band 
gap of Group 12-16 semiconductors is in the UV-visible region. This means 
the minimum energy required to excite an electron from the valence states 
of the Group 12-16 semiconductor QDs to its conduction states, lies in the 
UV-visible region. This is also a reason why most of the Group 12-16 
semiconductor quantum dot solutions are colored. 


This technique is complementary to fluorescence spectroscopy, in that UV- 
visible spectroscopy measures electronic transitions from the ground state 
to the excited state, whereas fluorescence deals with the transitions from the 
excited state to the ground state. In order to characterize the optical 
properties of a quantum dot, it is important to characterize the sample with 
both these techniques 


In quantum dots, due to the very small number of atoms, the addition or 
removal of one atom to the molecule changes the electronic structure of the 
quantum dot dramatically. Taking advantage of this property in Group 12- 
16 semiconductor quantum dots, it is possible to change the band gap of the 
material by just changing the size of the quantum dot. A quantum dot can 
absorb energy in the form of light over a range of wavelengths, to excite an 
electron from the ground state to its excited state. The minimum energy that 
is required to excite an electron, is dependent on the band gap of the 
quantum dot. Thus, by making accurate measurements of light absorption at 
different wavelengths in the ultraviolet and visible spectrum, a correlation 
can be made between the band gap and size of the quantum dot. Group 12- 
16 semiconductor quantum dots are of particular interest, since their band 
gap lies in the visible region of the solar spectrum. 


The UV-visible absorbance spectroscopy is a characterization technique in 
which the absorbance of the material is studied as a function of wavelength. 
The visible region of the spectrum is in the wavelength range of 380 nm 
(violet) to 740 nm (red) and the near ultraviolet region extends to 
wavelengths of about 200 nm. The UV-visible spectrophotometer analyzes 
over the wavelength range 200 — 900 nm. 


When the Group 12-16 semiconductor nanocrystals are exposed to light 
having an energy that matches a possible electronic transition as dictated by 
laws of quantum physics, the light is absorbed and an exciton pair is 
formed. The UV-visible spectrophotometer records the wavelength at which 
the absorption occurs along with the intensity of the absorption at each 
wavelength. This is recorded in a graph of absorbance of the nanocrystal 
versus wavelength. 


Instrumentation 


A working schematic of the UV-visible spectrophotometer is shown in 
[link]. 
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Schematic of UV-visible spectrophotometer. 


The light source 


Since it is a UV-vis spectrophotometer, the light source ([link]) needs to 
cover the entire visible and the near ultra-violet region (200 - 900 nm). 
Since it is not possible to get this range of wavelengths from a single lamp, 
a combination of a deuterium lamp for the UV region of the spectrum and 
tungsten or halogen lamp for the visible region is used. This output is then 
sent through a diffraction grating as shown in the schematic. 


The diffraction grating and the slit 


The beam of light from the visible and/or UV light source is then separated 

into its component wavelengths (like a very efficient prism) by a diffraction 
grating ([link]). Following the slit is a slit that sends a monochromatic beam 
into the next section of the spectrophotometer. 


Rotating discs 


Light from the slit then falls onto a rotating disc ([link]). Each disc consists 
of different segments — an opaque black section, a transparent section and a 
mirrored section. If the light hits the transparent section, it will go straight 
through the sample cell, get reflected by a mirror, hits the mirrored section 
of a second rotating disc, and then collected by the detector. Else if the light 
hits the mirrored section, gets reflected by a mirror, passes through the 
reference cell, hits the transparent section of a second rotating disc and then 
collected by the detector. Finally if the light hits the black opaque section, it 
is blocked and no light passes through the instrument, thus enabling the 
system to make corrections for any current generated by the detector in the 
absence of light. 


Sample cell, reference cell and sample preparation 


For liquid samples, a square cross section tube sealed at one end is used. 
The choice of cuvette depends on the following factors: 


¢ Type of solvent - For aqueous samples, specially designed rectangular 
quartz, glass or plastic cuvettes are used. For organic samples glass 
and quartz cuvettes are used. 

¢ Excitation wavelength — Depending on the size and thus, bandgap of 
the 12-16 semiconductor nanoparticles, different excitation 
wavelengths of light are used. Depending on the excitation 
wavelength, different materials are used 


Cuvette Wavelength (nm) 


Visible only glass 380 - 780 


Visible only plastic 380 - 780 
UV plastic 220 - 780 
Quartz 200 - 900 


Cuvette materials and their wavelengths. 


e Cost — Plastic cuvettes are the least expensive and can be discarded 
after use. Though quartz cuvettes have the maximum utility, they are 
the most expensive, and need to reused. Generally, disposable plastic 
cuvettes are used when speed is more important than high accuracy. 


The best cuvettes need to be very clear and have no impurities that might 
affect the spectroscopic reading. Defects on the cuvette such as scratches, 
can scatter light and hence should be avoided. Some cuvettes are clear only 
on two sides, and can be used in the UV-Visible spectrophotometer, but 
cannot be used for fluorescence spectroscopy measurements. For Group 12- 
16 semiconductor nanoparticles prepared in organic solvents, the quartz 
cuvette is chosen. 


In the sample cell the quantum dots are dispersed in a solvent, whereas in 
the reference cell the pure solvent is taken. It is important that the sample be 
very dilute (maximum first exciton absorbance should not exceed 1 au) and 
the solvent is not UV-visible active. For these measurements, it is required 
that the solvent does not have characteristic absorption or emission in the 
region of interest. Solution phase experiments are preferred, though it is 
possible to measure the spectra in the solid state also using thin films, 
powders, etc. The instrumentation for solid state UV-visible absorption 
spectroscopy is slightly different from the solution phase experiments and is 
beyond the scope of discussion. 


Detector 


Detector converts the light into a current signal that is read by a computer. 
Higher the current signal, greater is the intensity of the light. The computer 


then calculates the absorbance using the in [link], where A denotes 
absorbance, I is sample cell intensity and I, is the reference cell intensity. 
Equation: 


A = logio(ly/D 


The following cases are possible: 


Where I < Ip and A < 0. This usually occurs when the solvent absorbs 
in the wavelength range. Preferably the solvent should be changed, to 
get an accurate reading for actual reference cell intensity. 

Where I = Ip and A= 0. This occurs when pure solvent is put in both 
reference and sample cells. This test should always be done before 
testing the sample, to check for the cleanliness of the cuvettes. 

When A = 1. This occurs when 90% or the light at a particular 
wavelength has been absorbed, which means that only 10% is seen at 
the detector. So Ip/I becomes 100/10 = 10. Logo of 10 is 1. 

When A > 1. This occurs in extreme case where more than 90% of the 
light is absorbed. 


Output 


The output is the form of a plot of absorbance against wavelength, e.g., 
[link]. 
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Beer-Lambert law 


In order to make comparisons between different samples, it is important that 
all the factors affecting absorbance should be constant except the sample 
itself. 


Effect of concentration on absorbance 


The extent of absorption depends on the number of absorbing nanoparticles 
or in other words the concentration of the sample. If it is a reasonably 
concentrated solution, it will have a high absorbance since there are lots of 
nanoparticles to interact with the light. Similarly in an extremely dilute 
solution, the absorbance is very low. In order to compare two solutions, it is 
important that we should make some allowance for the concentration. 


Effect of container shape 


Even if we had the same concentration of solutions, if we compare two 
solutions — one in a rectagular shaped container (e.g., [link]) so that light 
travelled 1 cm through it and the other in which the light travelled 100 cm 
through it, the absorbance would be different. This is because if the length 
the light travelled is greater, it means that the light interacted with more 
number of nanocrystals, and thus has a higher absorbance. Again, in order 
to compare two solutions, it is important that we should make some 
allowance for the concentration. 


icm 


SA 
| 


| 


A typical 
rectangular 
cuvette for 
UV-visible 

spectroscopy 


The law 


The Beer-Lambert law addresses the effect of concentration and container 
shape as shown in [link], [link] and [link], where A denotes absorbance; € is 
the molar absorptivity or molar absorption coefficient; | is the path length of 
light (in cm); and c is the concentration of the solution (mol/dm?). 
Equation: 


logio(p/D = ele 


Equation: 


A = ésle 


Molar absorptivity 


From the Beer-Lambert law, the molar absorptivity 'e' can be expressed as 
shown in [link]. 
Equation: 


c = Alle 


Molar absorptivity corrects for the variation in concentration and length of 
the solution that the light passes through. It is the value of absorbance when 
light passes through 1 cm of a 1 mol/dm? solution. 


Limitations of Beer-Lambert law 


The linearity of the Beer-Lambert law is limited by chemical and 
instrumental factors. 


e At high concentrations (> 0.01 M), the relation between absorptivity 
coefficient and absorbance is no longer linear. This is due to the 
electrostatic interactions between the quantum dots in close proximity. 

e If the concentration of the solution is high, another effect that is seen is 
the scattering of light from the large number of quantum dots. 

e The spectrophotometer performs calculations assuming that the 
refractive index of the solvent does not change significantly with the 
presence of the quantum dots. This assumption only works at low 
concentrations of the analyte (quantum dots). 


e Presence of stray light. 


Analysis of data 


The data obtained from the spectrophotometer is a plot of absorbance as a 
function of wavelength. Quantitative and qualitative data can be obtained 
by analysing this information 


Quantitative Information 


The band gap of the semiconductor quantum dots can be tuned with the size 
of the particles. The minimum energy for an electron to get excited from the 
ground state is the energy to cross the band gap. In an absorption spectra, 
this is given by the first exciton peak at the maximum wavelength (Ajax). 


Size of the quantum dots 


The size of quantum dots can be approximated corresponding to the first 
exciton peak wavelength. Emperical relationships have been determined 
relating the diameter of the quantum dot to the wavelength of the first 
exciton peak. The Group 12-16 semiconductor quantum dots that they 
studied were cadmium selenide (CdSe), cadmium telluride (CdTe) and 
cadmium sulfide (CdS). The empirical relationships are determined by 
fitting experimental data of absorbance versus wavelength of known sizes 
of particles. The empirical equations determined are given for CdTe, CdSe, 
and CdS in [link], [link] and [link] respectively, where D is the diameter 
and A is the wavelength corresponding to the first exciton peak. For 
example, if the first exciton peak of a CdSe quantum dot is 500 nm, the 
corresponding diameter of the quantum dot is 2.345 nm and for a 
wavelength of 609 nm, the corresponding diameter is 5.008 nm. 
Equation: 


D = (9.8127 x 10°7)A3 - (1.7147 x 10°3)A2 + (1.0064)d - 194.84 


Equation: 


D = (1.6122 x 10°7)A3 - (2.6575 x 10°)A2 + (1.6242 x 10°3)A + 41.57 


Equation: 


D = (-6.6521 x 10°8)h3 + (1.9577 x 10-)22 - (9.2352 x 102)h + 13.29 


Concentration of sample 


Using the Beer-Lambert law, it is possible to calculate the concentration of 
the sample if the molar absorptivity for the sample is known. The molar 
absorptivity can be calculated by recording the absorbance of a standard 
solution of 1 mol/dm? concentration in a standard cuvette where the light 
travels a constant distance of 1 cm. Once the molar absorptivity and the 
absorbance of the sample are known, with the length the light travels being 
fixed, it is possible to determine the concentration of the sample solution. 


Empirical equations can be determined by fitting experimental data of 
extinction coefficient per mole of Group 12-16 semiconductor quantum 
dots, at 250 °C, to the diameter of the quantum dot, [link], [link], and [link]. 
Equation: 


é = 10043 x D?? 
Equation: 

€ = 5857 x D?*® 
Equation: 

é€ = 21536 x D*3 


The concentration of the quantum dots can then be then be determined by 
using the Beer Lambert law as given by [link]. 


Qualitative Information 


Apart from quantitative data such as the size of the quantum dots and 
concentration of the quantum dots, a lot of qualitative information can be 
derived from the absorption spectra. 


Size distribution 


If there is a very narrow size distribution, the first exciton peak will be very 
sharp ({link]). This is because due to the narrow size distribution, the 
differences in band gap between different sized particles will be very small 
and hence most of the electrons will get excited over a smaller range of 
wavelengths. In addition, if there is a narrow size distribution, the higher 
exciton peaks are also seen clearly. 
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Narrow emission spectra (a) and broad 
emission spectra (b) of CdSe QDs. 


Shaped particles 


In the case of a spherical quantum dot, in all dimensions, the particle is 
quantum confined ([link]). In the case of a nanorod, whose length is not in 
the quantum regime, the quantum effects are determined by the width of the 


nanorod. Similar is the case in tetrapods or four legged structures. The 
quantum effects are determined by the thickness of the arms. During the 
synthesis of the shaped particles, the thickness of the rod or the arm of the 
tetrapod does not vary among the different particles, as much as the length 
of the rods or arms changes. Since the thickness of the rod or tetrapod is 
responsible for the quantum effects, the absorption spectrum of rods and 
tetrapods has sharper features as compared to a quantum dot. Hence, 
qualitatively it is possible to differentiate between quantum dots and other 
shaped particles. 


Dot Rod Tetrapod 


Different shaped nanoparticles 
with the arrows indicating the 
dimension where quantum 
confinement effects are 
observed. 


Crystal lattice information 


In the case of CdSe semiconductor quantum dots it has been shown that it is 
possible to estimate the crystal lattice of the quantum dot from the 
adsorption spectrum ([link]), and hence determine if the structure is zinc 
blend or wurtzite. 
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Zinc blende and wurtzite CdSe 
absorption spectra. Adapted 
from J. Jasieniak, C. Bullen, J. 
van Embden, and P. Mulvaney, 
J. Phys. Chem. B, 2005, 109, 
20665. 


UV-vis absorption spectra of Group 12-16 semiconductor 
nanoparticles 


Cadmium selenide 


Cadmium selenide (CdSe) is one of the most popular Group 12-16 
semiconductors. This is mainly because the band gap (712 nm or 1.74 eV) 
energy of CdSe. Thus, the nanoparticles of CdSe can be engineered to have 
a range of band gaps throughout the visible range, corresponding to the 
major part of the energy that comes from the solar spectrum. This property 
of CdSe along with its fluorescing properties is used in a variety of 
applications such as solar cells and light emitting diodes. Though cadmium 
and selenium are known carcinogens, the harmful biological effects of 
CdSe can be overcome by coating the CdSe with a layer of zinc sulfide. 
Thus CdSe, can also be used as bio-markers, drug-delivery agents, paints 
and other applications. 


A typical absorption spectrum of narrow size distribution wurtzite CdSe 
quantum dot is shown in [link]. A size evolving absorption spectra is shown 
in [link]. However, a complete analysis of the sample is possible only by 
also studying the fluorescence properties of CdSe. 
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Wurtzite CdSe quantum dot. 

Adapted from X. Zhong, Y. 

Feng, and Y. Zhang, J. Phys. 
Chem. C, 2007, 111, 526. 
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Size evolving absorption 
spectra of CdSe quantum dots. 


Cadmium telluride (CdTe) 


Cadmium telluride has a band gap of 1.44 eV (860 nm) and as such it 
absorbs in the infrared region. Like CdSe, the sizes of CdTe can be 
engineered to have different band edges and thus, different absorption 
spectra as a function of wavelength. A typical CdTe spectra is shown in 
[link]. Due to the small bandgap energy of CdTe, it can be used in tandem 
with CdSe to absorb in a greater part of the solar spectrum. 
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Size evolving absorption 
spectra of CdTe quantum dots 
from 3 nm to 7 nm. Adapted 
from C. Qi-Fan, W. Wen-Xing, 
G. Ying-Xin, L. Meng-Ying, X. 
Shu-Kun and Z. Xiu-Juan, 
Chin. J. Anal. Chem., 2007, 35, 
135: 


Other Group 12-16 semiconductor systems 


[link] shows the bulk band gap of other Group 12-16 semiconductor 
systems. The band gap of ZnS falls in the UV region, while those of ZnSe, 
CdS, and ZnTe fall in the visible region. 


Material Band gap (eV) Wavelength (nm) 
ZnS 3.61 343.2 
ZnSe 2.69 460.5 
ZnTe 2.39 518.4 
CdS 2.49 497.5 
CdSe 1.74 712.1 
CdTe 1.44 860.3 


Bulk band gaps of different Group 12-16 semiconductors. 


Heterostructures of Group 12-16 semiconductor systems 


It is often desirable to have a combination of two Group 12-16 
semiconductor system quantum heterostructures of different shapes like 
dots and tetrapods, for applications in solar cells, bio-markers, etc. Some of 
the most interesting systems are ZnS shell-CdSe core systems, such as the 
CdSe/CdS rods and tetrapods. 


[link] shows a typical absorption spectra of CdSe-ZnS core-shell system. 
This system is important because of the drastically improved fluorescence 
properties because of the addition of a wide band gap ZnS shell than the 
core CdSe. In addition with a ZnS shell, CdSe becomes bio-compatible. 
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Absorption spectra of CdSe 
core, ZnS shell. Adapted from 
C. Qing-Zhu, P. Wang, X. Wang 
and Y. Li, Nanoscale Res. Lett., 
2008, 3, 213. 


A CdSe seed, CdS arm nanorods system is also interesting. Combining 
CdSe and CdS in a single nanostructure creates a material with a mixed 
dimensionality where holes are confined to CdSe while electrons can move 
freely between CdSe and CdS phases. 
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Optical Characterization of Group 12-16 (II-VI) Semiconductor 
Nanoparticles by Fluorescence Spectroscopy 


Group 12-16 semiconductor nanocrystals when exposed to light of a 
particular energy absorb light to excite electrons from the ground state to 
the excited state, resulting in the formation of an electron-hole pair (also 
known as excitons). The excited electrons relax back to the ground state, 
mainly through radiative emission of energy in the form of photons. 


Quantum dots (QD) refer to nanocrystals of semiconductor materials where 
the size of the particles is comparable to the natural characteristic separation 
of an electron-hole pair, otherwise known as the exciton Bohr radius of the 
material. In quantum dots, the phenomenon of emission of photons 
associated with the transition of electrons from the excited state to the 
ground state is called fluorescence. 


Fluorescence spectroscopy 


Emission spectroscopy, in general, refers to a characterization technique 
that measures the emission of radiation by a material that has been excited. 
Fluorescence spectroscopy is one type of emission spectroscopy which 
records the intensity of light radiated from the material as a function of 
wavelength. It is a nondestructive characterization technique. 


After an electron is excited from the ground state, it needs to relax back to 
the ground state. This relaxation or loss of energy to return to the ground 
state, can be achieved by a combination of non-radiative decay (loss of 
energy through heat) and radiative decay (loss of energy through light). 
Non-radiative decay by vibrational modes typically occurs between energy 
levels that are close to each other. Radiative decay by the emission of light 
occurs when the energy levels are far apart like in the case of the band gap. 
This is because loss of energy through vibrational modes across the band 
gap can result in breaking the bonds of the crystal. This phenomenon is 
shown in [link]. 
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Emission of luminescence photon for Group 12-16 
semiconductor quantum dot. 


The band gap of Group 12-16 semiconductors is in the UV-visible region. 
Thus, the wavelength of the emitted light as a result of radiative decay is 
also in the visible region, resulting in fascinating fluorescence properties. 


A fluorimeter is a device that records the fluorescence intensity as a 
function of wavelength. The fluorescence quantum yield can then be 
calculated by the ratio of photons absorbed to photons emitted by the 
system. The quantum yield gives the probability of the excited state getting 
relaxed via fluorescence rather than by any other non-radiative decay. 


Difference between fluorescence and phosphorescence 


Photoluminescence is the emission of light from any material due to the 
loss of energy from excited state to ground state. There are two main types 
of luminescence — fluorescence and phosphorescence. Fluorescence is a fast 
decay process, where the emission rate is around 10° s"! and the lifetime is 
around 1079 - 10°” s. Fluorescence occurs when the excited state electron 
has an opposite spin compared to the ground state electrons. From the laws 
of quantum mechanics, this is an allowed transition, and occurs rapidly by 
emission of a photon. Fluorescence disappears as soon as the exciting light 
source is removed. 


Phosphorescence is the emission of light, in which the excited state electron 
has the same spin orientation as the ground state electron. This transition is 
a forbidden one and hence the emission rates are slow (10? - 10° s“!). So the 
phosphorescence lifetimes are longer, typically seconds to several minutes, 
while the excited phosphors slowly returned to the ground state. 
Phosphorescence is still seen, even after the exciting light source is 
removed. Group 12-16 semiconductor quantum dots exhibit fluorescence 
properties when excited with ultraviolet light. 


Instrumentation 


The working schematic for the fluorometer is shown in [link]. 
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Schematic of fluorometer. 


The light source 


The excitation energy is provided by a light source that can emit 
wavelengths of light over the ultraviolet and the visible range. Different 
light sources can be used as excitation sources such as lasers, xenon arcs 
and mercury-vapor lamps. The choice of the light source depends on the 
sample. A laser source emits light of a high irradiance at a very narrow 
wavelength interval. This makes the need for the filter unnecessary, but the 
wavelength of the laser cannot be altered significantly. The mercury vapor 
lamp is a discrete line source. The xenon arc has a continuous emission 
spectrum between the ranges of 300 - 800 nm. 


The diffraction grating and primary filter 


The diffraction grating splits the incoming light source into its component 
wavelengths ([link]). The monochromator can then be adjusted to choose 
with wavelengths to pass through. Following the primary filter, specific 
wavelengths of light are irradiated onto the sample 


Sample cell and sample preparation 


A proportion of the light from the primary filter is absorbed by the sample. 
After the sample gets excited, the fluorescent substance returns to the 
ground state, by emitting a longer wavelength of light in all directions 
({link]). Some of this light passes through a secondary filter. For liquid 
samples, a square cross section tube sealed at one end and all four sides 
clear, is used as a sample cell. The choice of cuvette depends on three 
factors: 


1. Type of solvent - For aqueous samples, specially designed rectangular 
quartz, glass or plastic cuvettes are used. For organic samples glass 
and quartz cuvettes are used. 

2. Excitation wavelength — Depending on the size and thus, bandgap of 
the Group 12-16 semiconductor nanoparticles, different excitation 
wavelengths of light are used. Depending on the excitation 
wavelength, different materials are used ([link]). 


Cuvette Wavelength (nm) 


Visible only glass 380 - 780 
Visible only plastic 380 - 780 
UV plastic 220 - 780 
Quartz 200 - 900 


Cuvette materials and their wavelengths. 


3. Cost — Plastic cuvettes are the least expensive and can be discarded 
after use. Though quartz cuvettes have the maximum utility, they are 
the most expensive, and need to reused. Generally, disposable plastic 
cuvettes are used when speed is more important than high accuracy. 
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cuvette for 
fluorescence 
spectroscopy 


The cuvettes have a 1 cm path length for the light ((link]). The best cuvettes 
need to be very clear and have no impurities that might affect the 


spectroscopic reading. Defects on the cuvette, such as scratches, can scatter 
light and hence should be avoided. Since the specifications of a cuvette are 
the same for both, the UV-visible spectrophotometer and fluorimeter, the 
Same cuvette that is used to measure absorbance can be used to measure the 
fluorescence. For Group 12-16 semiconductor nanoparticles preparted in 
organic solvents, the clear four sided quartz cuvette is used. The sample 
solution should be dilute (absorbance <1 au), to avoid very high signal from 
the sample to burn out the detector. The solvent used to disperse the 
nanoparticles should not absorb at the excitation wavelength. 


Secondary filter 


The secondary filter is placed at a 90° angle ([link]) to the original light 
path to minimize the risk of transmitted or reflected incident light reaching 
the detector. Also this minimizes the amount of stray light, and results in a 
better signal-to-noise ratio. From the secondary filter, wavelengths specific 
to the sample are passed onto the detector. 


Detector 


The detector can either be single-channeled or multichanneled ([link]). The 
single-channeled detector can only detect the intensity of one wavelength at 
a time, while the multichanneled detects the intensity at all wavelengths 
simultaneously, making the emission monochromator or filter unnecessary. 
The different types of detectors have both advantages and disadvantages. 


Output 


The output is the form of a plot of intensity of emitted light as a function of 
wavelength as shown in [link]. 
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Emission spectra of CdSe 
quantum dot. 


Analysis of data 


The data obtained from fluorimeter is a plot of fluorescence intensity as a 
function of wavelength. Quantitative and qualitative data can be obtained 
by analysing this information. 


Quantitative information 


From the fluorescence intensity versus wavelength data, the quantum yield 
(®,) of the sample can be determined. Quantum yield is a measure of the 
ratio of the photons absorbed with respect to the photons emitted. It is 
important for the application of Group 12-16 semiconductor quantum dots 
using their fluorescence properties, for e.g., bio-markers. 


The most well-known method for recording quantum yield is the 
comparative method which involves the use of well characterized standard 
solutions. If a test sample and a standard sample have similar absorbance 
values at the same excitation wavelength, it can be assumed that the number 
of photons being absorbed by both the samples is the same. This means that 
a ratio of the integrated fluorescence intensities of the test and standard 


sample measured at the same excitation wavelength will give a ratio of 
quantum yields. Since the quantum yield of the standard solution is known, 
the quantum yield for the unknown sample can be calculated. 


A plot of integrated fluorescence intensity versus absorbance at the 
excitation wavelength is shown in [link]. The slope of the graphs shown in 
[link] are proportional to the quantum yield of the different samples. 
Quantum yield is then calculated using [link], where subscripts ST denotes 
standard sample and X denotes the test sample; QY is the quantum yield; RI 
is the refractive index of the solvent. 
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Integrated fluoresncene intensity as a function of 
absorbance. 


Equation: 


QY, = slopey (RIx)* 
OY sr 


slopecy (Rl gq)? 


Take the example of [link]. If the same solvent is used in both the sample 
and the standard solution, the ratio of quantum yields of the sample to the 


standard is given by [link]. If the quantum yield of the standard is known to 
0.95, then the quantum yield of the test sample is 0.523 or 52.3%. 
Equation: 
OY = Tat 
OY, 2.56 


The assumption used in the comparative method is valid only in the Beer- 
Lambert law linear regime. Beer-Lambert law states that absorbance is 
directly proportional to the path length of light travelled within the sample, 
and concentration of the sample. The factors that affect the quantum yield 
measurements are the following: 


e Concentration — Low concentrations should be used (absorbance < 
0.2 a.u.) to avoid effects such as self quenching. 

e Solvent — It is important to take into account the solvents used for the 
test and standard solutions. If the solvents used for both are the same 
then the comparison is trivial. However, if the solvents in the test and 
standard solutions are different, this difference needs to be accounted 
for. This is done by incorporating the solvent refractive indices in the 
ratio calculation. 

e Standard samples — The standard samples should be characterized 
thoroughly. In addition, the standard sample used should absorb at the 
excitation wavelength of the test sample. 

e Sample preparation — It is important that the cuvettes used are clean, 
scratch free and clear on all four sides. The solvents used must be of 
spectroscopic grade and should not absorb in the wavelength range. 

e Slit width — The slit widths for all measurements must be kept 
constant. 


The quantum yield of the Group 12-16 semiconductor nanoparticles are 
affected by many factors such as the following. 


e Surface defects — The surface defects of semiconductor quantum dots 
occur in the form of unsatisfied valencies. Thus resulting in unwanted 
recombinations. These unwanted recombinations reduce the loss of 
energy through radiative decay, and thus reducing the fluorescence. 


¢ Surface ligands — If the surface ligand coverage is a 100%, there is a 
smaller chance of surface recombinations to occur. 

¢ Solvent polarity — If the solvent and the ligand have similar solvent 
polarities, the nanoparticles are more dispersed, reducing the loss of 
electrons through recombinations. 


Qualitative Information 


Apart from quantum yield information, the relationship between intensity of 
fluorescence emission and wavelength, other useful qualitative information 
such as size distribution, shape of the particle and presence of surface 
defects can be obtained. 


As shown in [link], the shape of the plot of intensity versus wavelength is a 
Gaussian distribution. In [link], the full width at half maximum (FWHM) is 
given by the difference between the two extreme values of the wavelength 
at which the photoluminescence intensity is equal to half its maximum 
value. From the full width half max (FWHM) of the fluorescence intensity 
Gaussian distribution, it is possible to determine qualitatively the size 
distribution of the sample. For a Group 12-16 quantum dot sample if the 
FWHM is greater than 30, the system is very polydisperse and has a large 
size distribution. It is desirable for all practical applications for the FWHM 
to be lesser than 30. 
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Emission spectra of CdSe QDs 
showing the full width half maximum 
(FWHM). 


From the FWHM of the emission spectra, it is also possible to qualitatively 
get an idea if the particles are spherical or shaped. During the synthesis of 
the shaped particles, the thickness of the rod or the arm of the tetrapod does 
not vary among the different particles, as much as the length of the rods or 
arms changes. The thickness of the arm or rod is responsible for the 
quantum effects in shaped particles. In the case of quantum dots, the 
particle is quantum confined in all dimensions. Thus, any size distribution 
during the synthesis of quantum dots greatly affects the emission spectra. 
As a result the FWHM of rods and tetrapods is much smaller as compared 
to a quantum dot. Hence, qualitatively it is possible to differentiate between 
quantum dots and other shaped particles. 


Another indication of branched structures is the decrease in the intensity of 
fluorescence peaks. Quantum dots have very high fluorescence values as 
compared to branched particles, since they are quantum confined in all 
dimensions as compared to just 1 or 2 dimensions in the case of branched 
particles. 


Fluorescence spectra of different Group 12-16 semiconductor 
nanoparticles 


The emission spectra of all Group 12-16 semiconductor nanoparticles are 
Gaussian curves as shown in [link] and [link]. The only difference between 
them is the band gap energy, and hence each of the Group 12-16 
semiconductor nanoparticles fluoresce over different ranges of wavelengths 


Cadmium selenide 


Since its bulk band gap (1.74 eV, 712 nm) falls in the visible region 
cadmium Selenide (CdSe) is used in various applications such as solar cells, 
light emitting diodes, etc. Size evolving emission spectra of cadmium 
selenide is shown in [link]. Different sized CdSe particles have different 
colored fluorescence spectra. Since cadmium and selenide are known 
carcinogens and being nanoparticles are easily absorbed into the human 
body, there is some concern regarding these particles. However, CdSe 
coated with ZnS can overcome all the harmful biological effects, making 
cadmium selenide nanoparticles one of the most popular 12-16 
semiconductor nanoparticle. 
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Size evolving CdSe emission 
spectra. Adapted from 
http://www. physics.mq.edu.au. 


A combination of the absorbance and emission spectra is shown in [link] 
for four different sized particles emitting green, yellow, orange, and red 
fluorescence. 
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Absorption and emission spectra of CdSe quantum 
dots. Adapted from G. Schmid, Nanoparticles: 
From Theory to Application, Wiley-VCH, 
Weinham (2004). 


Cadmium telluride 


Cadmium Telluride (CdTe) has a band gap of 1.44 eV and thus absorbs in 
the infra red region. The size evolving CdTe emission spectra is shown in 
[link]. 
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Size evolution spectra of CdTe 
quantum dots. 


Adding shells to QDs 


Capping a core quantum dot with a semiconductor material with a wider 
bandgap than the core, reduces the nonradiative recombination and results 
in brighter fluorescence emission. Quantum yields are affected by the 
presences of free surface charges, surface defects and crystal defects, which 
results in unwanted recombinations. The addition of a shell reduces the 
nonradiative transitions and majority of the electrons relax radiatively to the 
valence band. In addition, the shell also overcomes some of the surface 
defects. 


For the CdSe-core/ZnS-shell systems exhibit much higher quantum yield as 
compared to core CdSe quantum dots as seen in [link]. 
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Carbon Nanomaterials 


Introduction 


Although nanomaterials had been known for many years prior to the report 
of Cgp the field of nanoscale science was undoubtedly founded upon this 
seminal discovery. Part of the reason for this explosion in nanochemistry is 
that while carbon materials range from well-defined nano sized molecules 
(i.e., Cgq) to tubes with lengths of hundreds of microns, they do not exhibit 
the instabilities of other nanomaterials as a result of the very high activation 
barriers to their structural rearrangement. As a consequence they are highly 
stable even in their unfunctionalized forms. Despite this range of carbon 
nanomaterials possible they exhibit common reaction chemistry: that of 
organic chemistry. 


The previously unknown allotrope of carbon: Cgg, was discovered in 1985, 
and in 1996, Curl, Kroto, and Smalley were awarded the Nobel Prize in 
Chemistry for the discovery. The other allotropes of carbon are graphite 
(sp*) and diamond (sp?). Cg, commonly known as the “buckyball” or 
“Buckminsterfullerene”, has a spherical shape comprising of highly 
pyramidalized sp* carbon atoms. The Ceo variant is often compared to the 
typical soccer football, hence buckyball. However, confusingly, this term is 
commonly used for higher derivatives. Fullerenes are similar in sheet 
structure to graphite but they contain pentagonal (or sometimes heptagonal) 
rings that prevent the sheet from being planar. The unusual structure of Cgg 
led to the introduction of a new class of molecules known as fullerenes, 
which now constitute the third allotrope of carbon. Fullerenes are 
commonly defined as “any of a class of closed hollow aromatic carbon 
compounds that are made up of twelve pentagonal and differing numbers of 
hexagonal faces.” 


The number of carbon atoms in a fullerene range from Cgp to C79, C76, and 
higher. Higher order fullerenes include carbon nanotubes that can be 
described as fullerenes that have been stretched along a rotational axis to 
form a tube. As a consequence of differences in the chemistry of fullerenes 
such as Cgg and C79 as compared to nanotubes, these will be dealt with 
separately herein. In addition there have also been reports of nanohorns and 


nanofibers, however, these may be considered as variations on the general 
theme. It should be noted that fullerenes and nanotubes have been shown to 
be in flames produced by hydrocarbon combustion. Unfortunately, these 
naturally occurring varieties can be highly irregular in size and quality, as 
well as being formed in mixtures, making them unsuitable for both research 
and industrial applications. 


Fullerenes 


Carbon-60 (Cgp) is probably the most studied individual type of 
nanomaterial. The spherical shape of Cgg is constructed from twelve 
pentagons and twenty hexagons and resembles a soccer ball ([link]a). The 
next stable higher fullerene is C7p ({link]b) that is shaped like a rugby or 
American football. The progression of higher fullerenes continues in the 
sequence C74, C76, C7, etc. The structural relationship between each 
involves the addition of six membered rings. Mathematically (and 
chemically) two principles define the existence of a stable fullerene, i.e., 
Euler’s theorem and isolated pentagon rule (IPR). Euler’s theorem states 
that for the closure of each spherical network, n (n = 2) hexagons and 12 
pentagons are required while the IPR says no two pentagons may be 
connected directly with each other as destabilization is caused by two 
adjacent pentagons. 


(2) (b) 


Molecular structures of (a) Cgp and (b) C7. 


Although fullerenes are composed of sp? carbons in a similar manner to 
graphite, fullerenes are soluble in various common organic solvents. Due to 
their hydrophobic nature, fullerenes are most soluble in CS (Cgg = 7.9 
mg/mL) and toluene (Cgp = 2.8 mg/mL). Although fullerenes have a 
conjugated system, their aromaticity is distinctive from benzene that has all 
C-C bonds of equal lengths, in fullerenes two distinct classes of bonds exist. 
The shorter bonds are at the junctions of two hexagons ([6, 6] bonds) and 
the longer bonds at the junctions of a hexagon and a pentagon ([5,6] bonds). 
This difference in bonding is responsible for some of the observed 
reactivity of fullerenes. 


Synthesis of fullerenes 


The first observation of fullerenes was in molecular beam experiments at 
Rice University. Subsequent studies demonstrated that Cg¢p it was relatively 
easy to produce grams of fullerenes. Although the synthesis is relatively 
straightforward fullerene purification remains a challenge and determines 
fullerene’s commercial price. The first method of production of measurable 
quantities of fullerenes used laser vaporization of carbon in an inert 
atmosphere, but this produced microscopic amounts of fullerenes. 
Laboratory scales of fullerene are prepared by the vaporization of carbon 
rods in a helium atmosphere. Commercial production ordinarily employs a 
simple ac or dc arc. The fullerenes in the black soot collected are extracted 
in toluene and purified by liquid chromatography. The magenta Cgg comes 
off the column first, followed by the red Cyo, and other higher fullerenes. 
Even though the mechanism of a carbon arc differs from that of a resistively 
heated carbon rod (because it involves a plasma) the He pressure for 
optimum Cg formation is very similar. 


A ratio between the mass of fullerenes and the total mass of carbon soot 
defines fullerene yield. The yields determined by UV-Vis absorption are 
approximately 40%, 10-15%, and 15% in laser, electric arc, and solar 
processes. Interestingly, the laser ablation technique has both the highest 
yield and the lowest productivity and, therefore, a scale-up to a higher 


power is costly. Thus, fullerene commercial production is a challenging 
task. The world's first computer controlled fullerene production plant is now 
operational at the MER Corporation, who pioneered the first commercial 
production of fullerene and fullerene products. 


Endohedral fullerenes 


Endohedral fullerenes are fullerenes that have incorporated in their inner 
sphere atoms, ions or clusters. Endohedral fullerenes are generally divided 
into two groups: endohedral metallofullerenes and non-metal doped 
fullerenes. The first endohedral metallofullerenes was called La@Cgo. The 
@ sign in the name reflects the notion of a small molecule trapped inside a 
shell. 


Doping fullerenes with metals takes place in-situ during the fullerene 
synthesis in an arc reactor or via laser evaporation. A wide range of metals 
have been encased inside a fullerene, i.e., Sc, Y, La, Ce, Ba, Sr, K, U, Zr, 
and Hf. Unfortunately, the synthesis of endohedral metallofullerenes is 
unspecific because in addition a high yield of unfilled fullerenes, 
compounds with different cage sizes are prepared (e.g., La@Cgo or 
La@Cg>). A characteristic of endohedral metallofullerenes is that electrons 
will transfer from the metal atom to the fullerene cage and that the metal 
atom takes a position off-center in the cage. The size of the charge transfer 
is not always simple to determine, but it is usually between 2 and 3 units 
(e.g., Lay>@Cgo) but can be as high as 6 electrons (e.g., ScsN@Cgo). These 
anionic fullerene cages are very stable molecules and do not have the 
reactivity associated with ordinary empty fullerenes (see below). This lack 
of reactivity is utilized in a method to purify endohedral metallofullerenes 
from empty fullerenes. 


The endohedral He@Cgy and Ne@Cgp form when Ceo is exposed to a 
pressure of around 3 bar of the appropriate noble gases. Under these 
conditions it was possible to dope 1 in every 650,000 Cgg cages with a 
helium atom. Endohedral complexes with He, Ne, Ar, Kr and Xe as well as 
numerous adducts of the He@Cgy compound have also been proven with 
operating pressures of 3000 bars and incorporation of up to 0.1 % of the 


noble gases. The isolation of N@Cgop, N@C7p and P@Cegp is very unusual 
and unlike the metal derivatives no charge transfer of the pnictide atom in 
the center to the carbon atoms of the cage takes place. 


Chemically functionalized fullerenes 


Although fullerenes have a conjugated aromatic system all the carbons are 
quatemary (i.e., containing no hydrogen), which results in making many of 
the characteristic substitution reactions of planar aromatics impossible. 
Thus, only two types of chemical transformations exist: redox reactions and 
addition reactions. Of these, addition reactions have the largest synthetic 
value. Another remarkable feature of fullerene addition chemistry is the 
thermodymics of the process. Since the sp? carbon atoms in a fullerene are 
paramidalized there is significant strain energy. For example, the strain 
energy in Cg is ca 8 kcal/mol, which is 80% of its heat of formation. So the 
relief of this strain energy leading to sp? hybridized C atoms is the major 
driving force for addition reactions ([link]). As a consequence, most 
additions to fullerenes are exothermic reactions. 
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Cyclic voltammetry (CV) studies show that Cgg can be reduced and 
oxidized reversibly up to 6 electrons with one-electron transfer processes. 
Fulleride anions can be generated by electrochemical method and then be 


used to synthesize covalent organofullerene derivatives. Alkali metals can 
chemically reduce fullerene in solution and solid state to form M,Cgg (x = 3 
- 6). Cg can also be reduced by less electropositive metals like mercury to 
form Cg” and Cg’. In addition, salts can also be synthesized with organic 
molecules, for example [TDAE"*][Cgq_] possesses interesting electronic and 
magnetic behavior. 


Geometric and electronic analysis predicted that fullerene behaves live an 
electro-poor conjugated polyolefin. Indeed Cgp and C79 undergo a range of 
nucleophilic reactions with carbon, nitrogen, phosphorous and oxygen 
nucleophiles. C60 reacts readily with organolithium and Grignard 
compounds to form alkyl, phenyl or alkanyl fullerenes. Possibly the most 
widely used additions to fullerene is the Bingel reaction ([link]), where a 
carbon nucleophile, generated by deprotonation of a-halo malonate esters or 
ketones, is added to form a cyclopropanation product. The a-halo esters and 
ketones can also be generated in situ with Ip or CBr, and a weak base as 
1,8-diazabicyclo[5.4.0]unde-7ene (DBU). The Bingel reaction is considered 
one of the most versatile and efficient methods to functionalize Cgpo. 
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Bingel reaction of Cgg with 2- 
bromoethylmalonate. 


Cycloaddition is another powerful tool to functionalize fullerenes, in 
particular because of its selectivity with the 6,6 bonds, limiting the possible 
isomers ([link]). The dienophilic feature of the [6,6] double bonds of Cgp 
enables the molecule to undergo various cycloaddition reactions in which 


the monoadducts can be generated in high yields. The best studies 
cycloadditon reactions of fullerene are [3+2] additions with 
diazoderivatives and azomethine ylides (Prato reactions). In this reaction, 
azomethine ylides can be generated in situ from condensation of a-amino 
acids with aldehydes or ketones, which produce 1,3 dipoles to further react 
with Cgp in good yields ({link]). Hundreds of useful building blocks have 
been generated by those two methods. The Prato reactions have also been 
successfully applied to carbon nanotubes. 


Geometrical shapes built onto a [6,6] ring 
junction: a) open, b) four-membered ring, c) 
five-membered ring, and d) six-membered ring. 


CH; 


Prato reaction of Cgq with N-methyglycine and 
paraformaldehyde. 


The oxidation of fullerenes, such as Cgo, has been of increasing interest 
with regard to applications in photoelectric devices, biological systems, and 
possible remediation of fullerenes. The oxidation of Cgg to CggO, (n = 1, 2) 
may be accomplished by photooxidation, ozonolysis, and epoxidation. With 
each of these methods, there is a limit to the isolable oxygenated product, 
CeO, with n < 3. Highly oxygenated fullerenes, Cg gO, with 3 <n < 9, have 
been prepared by the catalytic oxidation of Cgg with REMeO3/H20>. 


Carbon nanotubes 


A key breakthrough in carbon nanochemistry came in 1993 with the report 
of needle-like tubes made exclusively of carbon. This material became 
known as carbon nanotubes (CNTs). There are several types of nanotubes. 
The first discovery was of multi walled tubes (MWNTs) resembling many 
pipes nested within each other. Shortly after MWNTs were discovered 
single walled nanotubes (SWNTs) were observed. Single walled tubes 
resemble a single pipe that is potentially capped at each end. The properties 
of single walled and multi walled tubes are generally the same, although 
single walled tubes are believed to have superior mechanical strength and 
thermal and electrical conductivity; it is also more difficult to manufacture 
them. 


Single walled carbon nanotubes (SWNTs) are by definition fullerene 
materials. Their structure consists of a graphene sheet rolled into a tube and 
capped by half a fullerene ([link]). The carbon atoms in a SWNT, like those 
in a fullerene, are sp2 hybridized. The structure of a nanotube is analogous 
to taking this graphene sheet and rolling it into a seamless cylinder. The 
different types of SWNTs are defined by their diameter and chirality. Most 
of the presently used single-wall carbon nanotubes have been synthesized 
by the pulsed laser vaporization method, however, increasingly SWNTs are 
prepared by vapor liquid solid catalyzed growth. 


(b) 


(c) 


Structure of single walled carbon nanotubes 
(SWNTs) with (a) armchair, (b) zig-zag, and (c) 
chiral chirality. 


The physical properties of SWNTs have made them an extremely attractive 
material for the manufacturing of nano devices. SWNTs have been shown 
to be stronger than steel as estimates for the Young’s modulus approaches 1 
Tpa. Their electrical conductance is comparable to copper with anticipate 
current densities of up to 10'° A/cm? and a resistivity as low as 0.34 x 104 
Q.cm at room temperatures. Finally, they have a high thermal conductivity 
(3000 - 6000 W.m/K). 


The electronic properties of a particular SWNT structure are based on its 
chirality or twist in the structure of the tube which is defined by its n,m 
value. The values of n and m determine the chirality, or "twist" of the 
nanotube. The chirality in turn affects the conductance of the nanotube, its 
density, its lattice structure, and other properties. A SWNT is considered 
metallic if the value n-m is divisible by three. Otherwise, the nanotube is 
semi-conducting. The external environment also has an effect on the 


conductance of a tube, thus molecules such as O» and NH3 can change the 
overall conductance of a tube, while the presence of metals have been 
shown to significantly effect the opto-electronic properties of SWNTs. 


Multi walled carbon nanotubes (MWNTs) range from double walled NTs, 
through many-walled NTs ([{link]) to carbon nanofibers. Carbon nanofibers 
are the extreme of multi walled tubes ([link]) and they are thicker and 
longer than either SWNTs or MWNTs, having a cross-sectional of ca. 500 
A? and are between 10 to 100 pm in length. They have been used 
extensively in the construction of high strength composites. 


TEM image of an individual 
multi walled carbon nanotube 
(MWNTs). Copyright of 
Nanotech Innovations. 


A 


“ 


Magn Det WD -}—-—————-{ 2um 


Acc.V Spo e 
30.0kV 3.0 15000x SE 18.6 Hivac 


SEM image of vapor grown 
carbon nanofibers. 


Synthesis of carbon nanotubes 


A range of methodologies have been developed to produce nanotubes in 
sizeable quantities, including arc discharge, laser ablation, high pressure 
carbon monoxide (HiPco), and vapor liquid solid (VLS) growth. All these 
processes take place in vacuum or at low pressure with a process gases, 
although VLS growth can take place at atmospheric pressure. Large 
quantities of nanotubes can be synthesized by these methods; advances in 
catalysis and continuous growth processes are making SWNTs more 
commercially viable. 


The first observation of nanotubes was in the carbon soot formed during the 
arc discharge production of fullerenes. The high temperatures caused by the 
discharge caused the carbon contained in the negative electrode to sublime 
and the CNTs are deposited on the opposing electrode. Tubes produced by 
this method were initially multi walled tubes (MWNTs). However, with the 
addition of cobalt to the vaporized carbon, it is possible to grow single 
walled nanotubes. This method it produces a mixture of components, and 
requires further purification to separate the CNTs from the soot and the 
residual catalytic metals. Producing CNTs in high yield depends on the 


uniformity of the plasma arc, and the temperature of the deposit forming on 
the carbon electrode. 


Higher yield and purity of SWNTs may be prepared by the use of a dual- 
pulsed laser. SWNTs can be grown in a 50% yield through direct 
vaporization of a Co/Ni doped graphite rod with a high-powered laser in a 
tube furnace operating at 1200 °C. The material produced by this method 
appears as a mat of “ropes”, 10 - 20 nm in diameter and up to 100 pm or 
more in length. Each rope consists of a bundle of SWNTs, aligned along a 
common axis. By varying the process parameters such as catalyst 
composition and the growth temperature, the average nanotube diameter 
and size distribution can be varied. Although arc-discharge and laser 
vaporization are currently the principal methods for obtaining small 
quantities of high quality SWNTs, both methods suffer from drawbacks. 
The first is that they involve evaporating the carbon source, making scale- 
up on an industrial level difficult and energetically expensive. The second 
issue relates to the fact that vaporization methods grow SWNTs in highly 
tangled forms, mixed with unwanted forms of carbon and/or metal species. 
The SWNTs thus produced are difficult to purify, manipulate, and assemble 
for building nanotube-device architectures for practical applications. 


In order to overcome some of the difficulties of these high-energy 
processes, the chemical catalysis method was developed in which a 
hydrocarbon feedstock is used in combination with a metal catalyst. The 
catalyst is typically, but not limited to iron, colbalt, or iron/molybdenun,, it 
is heated under reducing conditions in the presence of a suitable carbon 
feedstock, e.g., ethylene. This method can be used for both SWNTs and 
MWNTs; the formation of each is controlled by the identity of the catalyst 
and the reaction conditions. A convenient laboratory scale apparatus is 
available from Nanotech Innovations, Inc., for the synthesis of highly 
uniform, consistent, research sample that uses pre-weighed catalyst/carbon 
source ampoules. This system, allows for 200 mg samples of MWNTs to be 
prepared for research and testing. The use of CO as a feedstock, in place of 
a hydrocarbon, led to the development of the high-pressure carbon 
monoxide (HiPco) procedure for SWNT synthesis. By this method, it is 
possible to produce gram quantities of SWNTs, unfortunately, efforts to 
scale beyond that have not met with complete success. 


Initially developed for small-scale investigations of catalyst activity, vapor 
liquid solid (VLS) growth of nanotubes has been highly studied, and now 
shows promise for large-scale production of nanotubes. Recent approaches 
have involved the use of well-defined nanoparticle or molecular precursors 
and many different transition metals have been employed, but iron, nickel, 
and cobalt remain to be the focus of most research. The nanotubes grow at 
the sites of the metal catalyst; the carbon-containing gas is broken apart at 
the surface of the catalyst particle, and the carbon is transported to the 
edges of the particle, where it forms the nanotube. The length of the tube 
grown in surface supported catalyst VLS systems appears to be dependent 
on the orientation of the growing tube with the surface. By properly 
adjusting the surface concentration and aggregation of the catalyst particles 
it is possible to synthesize vertically aligned carbon nanotubes, i.e., as a 
Carpet perpendicular to the substrate. 


Of the various means for nanotube synthesis, the chemical processes show 
the greatest promise for industrial scale deposition in terms of its price/unit 
ratio. There are additional advantages to the VLS growth, which unlike the 
other methods is capable of growing nanotubes directly on a desired 
substrate. The growth sites are controllable by careful deposition of the 
catalyst. Additionally, no other growth methods have been developed to 
produce vertically aligned SWNTs. 


Chemical functionalization of carbon nanotubes 


The limitation of using carbon nanotubes in any practical applications has 
been its solubility; for example SWNTs have little to no solubility in most 
solvent due to the aggregation of the tubes. Aggregation/roping of 
nanotubes occurs as a result of the high van der Waals binding energy of ca. 
500 eV per mm of tube contact. The van der Waals force between the tubes 
is so great, that it take tremendous energy to pry them apart, making it very 
to make combination of nanotubes with other materials such as in 
composite applications. The functionalization of nanotubes, i.e., the 
attachment of “chemical functional groups” provides the path to overcome 
these barriers. Functionalization can improve solubility as well as 
processibility, and has been used to align the properties of nanotubes to 


those of other materials. The clearest example of this is the ability to 
solubilize nanotubes in a variety of solvents, including water. It is important 
when discussing functionalization that a distinction is made between 
covalent and non-covalent functionalization. 


Current methods for solubilizing nanotubes without covalent 
functionalization include highly aromatic solvents, super acids, polymers, 
or surfactants. Non-covalent “functionalization” is generally on the concept 
of supramolecular interactions between the SWNT and some 
macromolecule as a result of various adsorption forces, such as van der 
Waals’ and m-stacking interactions. The chemical speciation of the nanotube 
itself is not altered as a result of the interaction. In contrast, covalent 
functionalization relies on the chemical reaction at either the sidewall or 
end of the SWNT. As may be expected the high aspect ratio of nanotubes 
means that sidewall functionalization is much more important than the 
functionalization of the cap. Direct covalent sidewall functionalization is 
associated with a change of hybridization from sp? to sp* and a 
simultaneous loss of conjugation. An alternative approach to covalent 
functionalization involves the reaction of defects present (or generated) in 
the structure of the nanotube. Defect sites can be the open ends and holes in 
the sidewalls, and pentagon and heptagon irregularities in the hexagon 
graphene framework (often associated with bends in the tubes). All these 
functionalizations are exohedral derivatizations. Taking the hollow structure 
of nanotubes into consideration, endohedral functionalization of SWNTs is 
possible, i.e., the filling of the tubes with atoms or small molecules. It is 
important to note that covalent functionalization methods have one problem 
in common: extensive covalent functionalization modifies SWNT 
properties by disrupting the continuous m—-system of SWNTSs. 


Various applications of nanotubes require different, specific modification to 
achieve desirable physical and chemical properties of nanotubes. In this 
regard, covalent functionalization provides a higher degree of fine-tuning 
the chemistry and physics of SWNTs than non-covalent functionalization. 
Until now, a variety of methods have been used to achieve the 
functionalization of nanotubes ((link]). 
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Schematic description of various covalent 
functionalization strategies for SWNTs. 


Taking chemistry developed for Cgg, SWNTs may be functionalized using 
1,3 dipolar addition of azomethine ylides. The functionalized SWNTs are 
soluble in most common organic solvents. The azomethine ylide 
functionalization method was also used for the purification of SWNTs. 
Under electrochemical conditions, aryl diazonium salts react with SWNTs 
to achieve functionalized SWNTs, alternatively the diazonium ions may be 
generated in-situ from the corresponding aniline, while a solvent free 
reaction provides the best chance for large-scale functionalization this way. 
In each of these methods it is possible to control the amount of 
functionalization on the tube by varying reaction times and the reagents 
used; functionalization as high as 1 group per every 10 - 25 carbon atoms is 
possible. 


Organic functionalization through the use of alkyl halides, a radical 
pathway, on tubes treated with lithium in liquid ammonia offers a simple 
and flexible route to a range of functional groups. In this reaction, 
functionalization occurs on every 17 carbons. Most success has been found 
when the tubes are dodecylated. These tubes are soluble in chloroform, 
DMF, and THF. 


The addition of oxygen moieties to SWNT sidewalls can be achieved by 
treatment with acid or wet air oxidation, and ozonolysis. The direct 
epoxidation of SWNTs may be accomplished by the direct reaction with a 
peroxide reagent, or catalytically. Catalytic de-epoxidation (({link]) allows 


for the quantitative analysis of sidewall epoxide and led to the surprising 
result that previously assumed “pure” SWNTs actually contain ca. 1 oxygen 
per 250 carbon atoms. 


ReMeO3 + H909 
\ J 
ReMeO3 + PPhg 


Catalytic oxidation and de- 
epoxidation of SWNTs. 


One of the easiest functionalization routes, and a useful synthon for 
subsequent conversions, is the fluorination of SWNTs, using elemental 
fluorine. Importantly, a C:F ratios of up to 2:1 can be achieved without 
disruption of the tubular structure. The fluorinated SWNTs (F-SWNTs) 
proved to be much more soluble than pristine SWNTs in alcohols (1 mg/mL 
in iso-propanol), DMF and other selected organic solvents. Scanning 
tunneling microscopy (STM) revealed that the fluorine formed bands of 
approximately 20 nm, while calculations using DFT revealed 1,2 addition is 
more energetically preferable than 1,4 addition, which has been confirmed 
by solid state '3C NMR. F-SWNTs make highly flexible synthons and 
subsequent elaboration has been performed with organo lithium, Grignard 
reagents, and amines. 


Functionalized nanotubes can be characterized by a variety of techniques, 
such as atomic force microscopy (AFM), transmission electron microscopy 
(TEM), UV-vis spectroscopy, and Raman spectroscopy. Changes in the 


Raman spectrum of a nanotube sample can indicate if functionalization has 
occurred. Pristine tubes exhibit two distinct bands. They are the radial 
breathing mode (230 cm-!) and the tangential mode (1590 cm“!). When 
functionalized, a new band, called the disorder band, appears at ca.1350 
cm’!. This band is attributed to sp°-hybridized carbons in the tube. 
Unfortunately, while the presence of a significant D mode is consistent with 
sidewall functionalization and the relative intensity of D (disorder) mode 
versus the tangential G mode (1550 — 1600 cm‘) is often used as a measure 
of the level of substitution. However, it has been shown that Raman is an 
unreliable method for determination of the extent of functionalization since 
the relative intensity of the D band is also a function of the substituents 
distribution as well as concentration. Recent studies suggest that solid state 
13C NMR are possibly the only definitive method of demonstrating covalent 
attachment of particular functional groups. 


Coating carbon nanotubes: creating inorganic nanostructures 


Fullerenes, nanotubes and nanofibers represent suitable substrates for the 
seeding other materials such as oxides and other minerals, as well as 
semiconductors. In this regard, the carbon nanomaterial acts as a seed point 
for the growth as well as a method of defining unusual aspect ratios. For 
example, silica fibers can be prepared by a number of methods, but it is 
only through coating SWNTs that silica nano-fibers with of micron lengths 
with tens of nanometers in diameter may be prepared. 


While Cp itself does not readily seed the growth of inorganic materials, 
liquid phase deposition of oxides, such as silica, in the presence of 
fullerenol, Cgq(OH),, results in the formation of uniform oxide spheres. It 
appears the fullerenol acts as both a reagent and a physical point for 
subsequent oxide growth, and it is Ceo, or an aggregate of Cgo, that is 
present within the spherical particle. The addition of fullerenol alters the 
morphology and crystal phase of CaCO3 precipitates from aqueous solution, 
resulting in the formation of spherical features, 5-pointed flower shaped 
clusters, and triangular crystals as opposed to the usual rhombic crystals. In 
addition, the meta-stable vaterite phase is observed with the addition of 
Ceo(OH)n- 


As noted above individual SWNTs may be obtained in solution when 
encased in a cylindrical micelle of a suitable surfactant. These 
individualized nanotubes can be coated with a range of inorganic materials. 
Liquid phase deposition (LPD) appears to have significant advantages over 
other methods such as incorporating surfacted SWNTs into a preceramic 
matrix, in situ growth of the SWNT in an oxide matrix, and sol-gel 
methods. The primary advantage of LPD growth is that individual SWNTs 
may be coated rather than bundles or ropes. For example, SWNTs have 
been coated with silica by liquid phase deposition (LPD) using a 
Silica/H»SiF, solution and a surfactant-stabilized solution of SWNTs. The 
thickness of the coating is dependent on the reaction mixture concentration 
and the reaction time. The SWNT core can be removed by thermolysis 
under oxidizing conditions to leave a silica nano fiber. It is interesting to 
note that the use of a surfactant is counter productive when using MWNTs 
and VGFs, in this case surface activation of the nanotube offers the suitable 
growth initiation. Pre-oxidation of the MWNT or VGF allows for uniform 
coatings to be deposited. The coated SWNTs, MWNTs, and VGFs can be 
subsequently reacted with suitable surface reagents to impart miscibility in 
aqueous solutions, guar gels, and organic matrixes. In addition to simple 
oxides, coated nanotubes have been prepared with minerals such as 
carbonates and semiconductors. 
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Graphene 


Introduction 


Graphene is a one-atom-thick planar sheet of sp*-bonded carbon atoms that 
are densely packed in a honeycomb crystal lattice ({link]). The name comes 
from “graphite” and “alkene”; graphite itself consists of many graphene 
sheets stacked together. 


Idealized structure of a single graphene 
sheet. 


Single-layer graphene nanosheets were first characterized in 2004, prepared 
by mechanical exfoliation (the “scotch-tape” method) of bulk graphite. 
Later graphene was produced by epitaxial chemical vapor deposition on 
silicon carbide and nickel substrates. Most recently, graphene nanoribbons 
(GNRs) have been prepared by the oxidative treatment of carbon nanotubes 
and by plasma etching of nanotubes embedded in polymer films. 


Physical properties of graphene 


Graphene has been reported to have a Young’s modulus of 1 TPa and 
intrinsic strength of 130 GP; similar to single walled carbon nanotubes 
(SWNTs). The electronic properties of graphene also have some similarity 
with carbon nanotubes. Graphene is a zero-bandgap semiconductor. 
Electron mobility in graphene is extraordinarily high (15,000 cm?/V.s at 
room temperature) and ballistic electron transport is reported to be on 
length scales comparable to that of SWNTs. One of the most promising 
aspects of graphene involves the use of GNRs. Cutting an individual 
graphene layer into a long strip can yield semiconducting materials where 
the bandgap is tuned by the width of the ribbon. 


While graphene’s novel electronic and physical properties guarantee this 
material will be studied for years to come, there are some fundamental 
obstacles yet to overcome before graphene based materials can be fully 
utilized. The aforementioned methods of graphene preparation are effective; 
however, they are impractical for large-scale manufacturing. The most 
plentiful and inexpensive source of graphene is bulk graphite. Chemical 
methods for exfoliation of graphene from graphite provide the most realistic 
and scalable approach to graphene materials. 


Graphene layers are held together in graphite by enormous van der Waals 
forces. Overcoming these forces is the major obstacle to graphite 
exfoliation. To date, chemical efforts at graphite exfoliation have been 
focused primarily on intercalation, chemical derivatization, thermal 
expansion, oxidation-reduction, the use of surfactants, or some combination 
of these. 


Graphite oxide 


Probably the most common route to graphene involves the production of 
graphite oxide (GO) by extremely harsh oxidation chemistry. The methods 
of Staudenmeier or Hummers are most commonly used to produce GO, a 
highly exfoliated material that is dispersible in water. The structure of GO 
has been the subject of numerous studies; it is known to contain epoxide 
functional groups along the basal plane of sheets as well as hydroxyl and 
carboxyl moieties along the edges ({link]). In contrast to other methods for 
the synthesis of GO, the the m-peroxybenzoic acid (m-CPBA) oxidation of 


microcrystalline synthetic graphite at room temperature yields graphite 
epoxide in high yield, without significant additional defects. 


Idealized structure proposed for graphene 
oxide (GO). Adapted from C. E. Hamilton, 
PhD Thesis, Rice University (2009). 


As graphite oxide is electrically insulating, it must be converted by 
chemical reduction to restore the electronic properties of graphene. 
Chemically converted graphene (CCG) is typically reduced by hydrazine or 
borohydride. The properties of CCG can never fully match those of 
graphene for two reasons: 


1. Oxidation to GO introduces defects. 
2. Chemical reduction does not fully restore the graphitic structure. 


As would be expected, CCG is prone to aggregation unless stabilized. 
Graphene materials produced from pristine graphite avoid harsh oxidation 
to GO and subsequent (incomplete) reduction; thus, materials produced are 
potentially much better suited to electronics applications. 


A catalytic approach to the removal of epoxides from fullerenes and 
SWNTs has been applied to graphene epoxide and GO. Treatment of 
oxidized graphenes with methyltrioxorhenium (MeReO3, MTO) in the 
presence of PPh; results in the oxygen transfer, to form O=PPh3 and allow 
for quantification of the C:O ratio. 


Homogeneous graphene dispersions 


An alternate approach to producing graphene materials involves the use of 
pristine graphite as starting material. The fundamental value of such an 
approach lies in its avoidance of oxidation to GO and subsequent 
(incomplete) reduction, thereby preserving the desirable electronic 
properties of graphene. There is precedent for exfoliation of pristine 
graphite in neat organic solvents without oxidation or surfactants. It has 
been reported that N,N-dimethylformamide (DMF) dispersions of graphene 
are possible, but no detailed characterization of the dispersions were 
reported. In contrast, Coleman and coworkers reported similar dispersions 
using N-methylpyrrolidone (NMP), resulting in individual sheets of 
graphene at a concentration of <0.01 mg/mL. NMP and DMF are highly 
polar solvents, and not ideal in cases where reaction chemistry requires a 
nonpolar medium. Further, they are hygroscopic, making their use 
problematic when water must be excluded from reaction mixtures. Finally, 
DMF is prone to thermal and chemical decomposition. 


Recently, dispersions of graphene has been reported in ortho- 
dichlorobenzene (ODCB) using a wide range of graphite sources. The 
choice of ODCB for graphite exfoliation was based on several criteria: 


1. ODCB is a common reaction solvent for fullerenes and is known to 
form stable SWNT dispersions. 

2. ODCB is a convenient high-boiling aromatic, and is compatible with a 
variety of reaction chemistries. 

3. ODCB, being aromatic, is able to interact with graphene via 1-1 
stacking. 

4. It has been suggested that good solvents for graphite exfoliation should 
have surface tension values of 40 — 50 mJ/m?. ODCB has a surface 
tension of 36.6 mJ/m?, close to the proposed range. 


Graphite is readily exfoliated in ODCB with homogenization and 
sonication. Three starting materials were successfully dispersed: 
microcrystalline synthetic, thermally expanded, and highly ordered 
pyrolytic graphite (HOPG). Dispersions of microcrystalline synthetic 
graphite have a concentration of 0.03 mg/mL, determined gravimetrically. 
Dispersions from expanded graphite and HOPG are less concentrated (0.02 
mg/mL). 


High resolution transmission electron microscopy (HRTEM) shows mostly 
few-layer graphene (n < 5) with single layers and small flakes stacked on 
top ({link]). Large graphitic domains are visible; this is further supported by 
selected area electron diffraction (SAED) and fast Fourier transform (FFT) 
in selected areas. Atomic force microscope (AFM) images of dispersions 
sprayed onto silicon substrates shows extremely thin flakes with nearly all 
below 10 nm. Average height is 7 - 10 nm. The thinnest are less than 1 nm, 
graphene monolayers. Lateral dimensions of nanosheets range from 100 — 
5900 nm. 


TEM images of single layer graphene from HOPG 
dispersion. (a) monolayer and few layer of graphene 
stacked with smaller flakes; (b) selected edge region 

from (a), (c) selected area from (b) with FFT inset, (d) 
HRTEM of boxed region in (c) showing lattice fringes 
with FFT inset. Adapted from C. E. Hamilton, PhD 
Thesis, Rice University (2009). 


As-deposited films cast from ODCB graphene show poor electrical 
conductivity, however, after vacuum annealing at 400 °C for 12 hours the 
films improve vastly, having sheet resistances on the order of 60 Q/sq. By 


comparison, graphene epitaxially grown on Ni has a reported sheet 
resistance of 280 Q2/sq. 


Covalent functionalization of graphene and graphite oxide 


The covalent functionalization of SWNTs is well established. Some routes 
to covalently functionalized SWNTs include esterification/ amidation, 
reductive alkylation (Billups reaction), and treatment with azomethine 
ylides (Prato reaction), diazonium salts, or nitrenes. Conversely, the 
chemical derivatization of graphene and GO is still relatively unexplored. 


Some methods previously demonstrated for SWNTs have been adapted to 
GO or graphene. GO carboxylic acid groups have been converted into acyl 
chlorides followed by amidation with long-chain amines. Additionally, the 
coupling of primary amines and amino acids via nucleophilic attack of GO 
epoxide groups has been reported. Yet another route coupled isocyanates to 
carboxylic acid groups of GO. Functionalization of partially reduced GO by 
aryldiazonium salts has also been demonstrated. The Billups reaction has 
been performed on the intercalation compound potassium graphite (Cgk), 
as well as graphite fluoride, and most recently GO. Graphene alkylation has 
been accomplished by treating graphite fluoride with alkyllithium reagents. 


ODCB dispersions of graphene may be readily converted to covalently 
functionalize graphene. Thermal decomposition of benzoyl peroxide is used 
to initiate radical addition of alkyl iodides to graphene in ODCB 
dispersions. 

Equation: 


R-I, benzoyl peroxide R RR R 


Additionally, functionalized graphene with nitrenes generated by thermal 
decomposition of aryl azides 
Equation: 


\ 
ArN3 N 


Ar Ar 
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Rolling Molecules on Surfaces Under STM Imaging 


Introduction to surface motions at the molecular level 


As single molecule imaging methods such as scanning tunneling 
microscope (STM), atomic force microscope (AFM), and transmission 
electron microscope (TEM) developed in the past decades, scientists have 
gained powerful tools to explore molecular structures and behaviors in 
previously unknown areas. Among these imaging methods, STM is 
probably the most suitable one to observe detail at molecular level. STM 
can operate in a wide range of conditions, provides very high resolution, 
and able to manipulate molecular motions with the tip. An interesting early 
example came from IBM in 1990, in which the STM was used to position 
individual atoms for the first time, spelling out "I-B-M" in Xenon atoms. 
This work revealed that observation and control of single atoms and 
molecular motions on surfaces were possible. 


The IBM work, and subsequent experiments, relied on the fact that STM tip 
always exerts a finite force toward an adsorbate atom that contains both van 
der Waals and electrostatic forces was utilized for manipulation purpose. By 
adjusting the position and the voltage of the tip, the interactions between the 
tip and the target molecule were changed. Therefore, applying/releasing 
force to a single atom and make it move was possible [link]. 


STM Tip 


Metal Substrate 


Manipulation of STM tip toward a xenon 
atom. a) STM tip move onto a target atom 
then change the voltage and current of the 
tip to apply a stronger interaction. b) Move 

the atom to a desire position. c) After 
reaching the desire position, the tip released 
by switching back to the scanning voltage 
and current. 


The actual positioning experiment was carried out in the following process. 
The nickel metal substrate was prepared by cycles of argon-ion sputtering, 
followed by annealing in a partial pressure of oxygen to remove surface 
carbon and other impurities. After the cleaning process, the sample was 
cooled to 4 K, and imaged with the STM to ensure the quality of surface. 


The nickel sample was then doped with xenon. An image of the doped 
sample was taken at constant-current scanning conditions. Each xenon atom 
appears as a located randomly 1.6 A high bump on the surface ([link]a). 
Under the imaging conditions (tip bias = 0.010 V with tunneling current 10° 
9 A) the interaction of the xenon with the tip is too weak to cause the 
position of the xenon atom to be perturbed. To move an atom, the STM tip 
was placed on top of the atom performing the procedure depicted in [link] 
to move it to its target. Repeating this process again and again led the 
researcher to build of the structure they desired [link]b and c. 


Manipulation of STM tip starting with a) randomly dosed xenon 
sample, b) under construction - move xenon atom to desire position, 
and c) accomplishment of the manipulation. Adapted from D. M. 
Eigler and E. K. Schweizer, Nature, 1990, 344, 524. 


All motions on surfaces at the single molecule level can be described as by 
the following (or combination of the following) modes: 


i. Sliding. 
li. Hopping. 
iii. Rolling. 
iv. Pivoting. 


Although the power of STM imaging has been demonstrated, imaging of 
molecules themselves is still often a difficult task. The successful imaging 
of the IBM work was attributed to selection of a heavy atom. Other 
synthetic organic molecules without heavy atoms are much more difficult to 
be imaged under STM. Determinations of the mechanism of molecular 


motion is another. Besides imaging methods themselves, other auxiliary 
methods such as DFT calculations and imaging of properly designed 
molecules are required to determine the mechanism by which a particular 
molecule moves across a surface. 


Herein, we are particularly interested in surface-rolling molecules, i.e., 
those that are designed to roll on a surface. It is straightforward to imagine 
that if we want to construct (and image) surface-rolling molecules, we must 
think of making highly symmetrical structures. In addition, the magnitudes 
of interactions between the molecules and the surfaces have to be adequate; 
otherwise the molecules will be more susceptible to slide/hop or stick on 
the surfaces, instead of rolling. As a result, only very few molecules are 
known can roll and be detected on surfaces. 


Surface rolling of molecules under the manipulation of STM 
tips 


As described above, rolling motions are most likely to be observed on 
molecules having high degree of symmetry and suitable interactions 
between themselves and the surface. C¢o is not only a highly symmetrical 
molecule but also readily imageable under STM due to its size. These 
properties together make Cgg and its derivatives highly suitable to study 
with regards to surface-rolling motion. 


The STM imaging of Cg was first carried out at At King College, London. 
Similar to the atom positioning experiment by IBM, STM tip manipulation 
was also utilized to achieve Cg, displacement. The tip trajectory suggested 
that a rolling motion took into account the displacement on the surface of 
Cg. In order to confirm the hypothesis, the researchers also employed ab 
initio density function (DFT) calculations with rolling model boundary 
condition ({link]). The calculation result has supported their experimental 
result. 


a) b) Cc) 
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Proposed mechanism of C¢p translation showing the alteration of 
Cgo°surface interactions during rolling. a) 2-point interaction. 
The left point interaction was dissociated during the interaction. 
b) 1-point interaction. Cgg can pivot on surface. c) 2-point 
interaction. A new interaction formed to complete part of the 
rolling motion. a) - c) The black spot on the Cgg is moved during 
the manipulation. The light blue Si balls represent the first layer 
of molecules the silicon surface, and the yellow balls are the 
second layer. 


The results provided insights into the dynamical response of covalently 
bound molecules to manipulation. The sequential breaking and reforming of 
highly directional covalent bonds resulted in a dynamical molecular 
response in which bond breaking, rotation, and translation are intimately 
coupled in a rolling motion ({link]), but not performing sliding or hopping 
motion. 


A triptycene wheeled dimeric molecule [link] was also synthesized for 
studying rolling motion under STM. This "tripod-like" triptycene wheel 
ulike a ball like Cgg molecule also demonstrated a rolling motion on the 
surface. The two triptycene units were connected via a dialkynyl axle, for 
both desired molecule orientation sitting on surface and directional 
preference of the rolling motion. STM controlling and imaging was 
demonstrated, including the mechanism [link]. 


Scheme of the rolling mechanism (left to right). 
Step 1 is the tip approach towards the molecule, 
step 2 is a 120 degree rotation of a wheel 
around its molecular axle and in step 3 the tip 
reaches the other side of the molecule. It shows 
that, in principle, only one rotation of a wheel 
can be induced (the direction of movement is 
marked by arrows). 


Single molecule nanocar under STM imaging 


Another use of STM imaging at single molecule imaging is the single 
molecule nanocar by the Tour group at Rice University. The concept of a 
nanocar initially employed the free rotation of a C-C single bond between a 
spherical Cgg molecule and an alkyne, [link]. Based on this concept, an 
“axle” can be designed into which are mounted Cgp “wheels” connected 
with a “chassis” to construct the “nanocar”. Nanocars with this design are 


expected to have a directional movement perpendicular to the axle. 
Unfortunately, the first generation nanocar (named “nanotruck” [link]) 
encountered some difficulties in STM imaging due to its chemical 
instability and insolubility. Therefore, a new of design of nanocar based on 
OPE has been synthesized [link]. 


Structure of Cg 
wheels connecting to 
an alkyne. The only 
possible rolling 
direction is 
perpendicular to the 
C-C single bond 
between Cgg and the 
alkyne. The arrow 
indicates the 
rotational motion of 
Ceo. 


Rotating axle Planar chassis Spherical wheel 


Structure of the nanotruck. No rolling motion was 
observed under STM imaging due to its instability, 
insolubility and inseparable unreacted Cgy. The double 
head arrow indicates the expected direction of nanocar 
movement. Y. Shirai, A. J. Osgood, Y. Zhao, Y. Yao, L. 
Saudan, H. Yang, Y.-H. Chiu, L. B. Alemany, T. Sasaki, 
J.-F. Morin, J. M. Guerrero, K. F. Kelly, and J. M. Tour, 
J. Am. Chem. Soc., 2006, 128, 4854. Copyright 
American Chemical Society (2006). 


Nanocar based on OPE structure. The size of the nanocar is 
3.3 nm X 2.1 nm (W x L). Alkoxy chains were attached to 
improve solubility and stability. OPE moiety is also separable 
from Cgg. The bold double head arrow indicates the expected 
direction of nanocar movement. The dimension of nanocar 
was 3.3 nm X 2.1 nm which enable direct observation of the 
orientation under STM imaging. Y. Shirai, A. J. Osgood, Y. 
Zhao, K. F. Kelly, and J. M. Tour, Nano Lett., 2005, 5, 2330. 
Copyright American Chemical Society (2005). 


The newly designed nanocar was studied with STM. When the nanocar was 
heated to ~200 °C, noticeable displacements of the nanocar were observed 
under selected images from a 10 min STM experiment [link]. The 
phenomenon that the nanocar moved only at high temperature was 
attributed their stability to a relatively strong adhesion force between the 


fullerene wheels and the underlying gold. The series of images showed both 
pivotal and translational motions on the surfaces. 


«H> 


Translational Motion 


Pivotal and translational 
movement of OPE based 


nanocar. Acquisition time of 
one image is approximately 1 
min with (a — e) images were 
selected from a series spanning 
10 min. The configuration of 
the nanocar on surface can be 
determined by the distances of 
four wheels. a) — b) indicated 
the nanocar had made a 80° 
pivotal motion. b) — e) 
indicated translation 
interrupted by small-angle 
pivot perturbations. Y. Shirai, 
A. J. Osgood, Y. Zhao, K. F. 
Kelly, and J. M. Tour, Nano 
Lett., 2005, 5, 2330. Copyright 
American Chemical Society 
(2005). 


Although literature studies suggested that the Cgg molecule rolls on the 
surface, in the nanocar movement studies it is still not possible to 
conclusively conclude that the nanocar moves on surface exclusively via a 
rolling mechanism. Hopping, sliding and other moving modes could also be 
responsible for the movement of the nanocar since the experiment was 
carried out at high temperature conditions, making the Cgg molecules more 
energetic to overcome interactions between surfaces. 


To tackle the question of the mode of translation, a trimeric “nano-tricycle” 
has been synthesized. If the movement of fullerene-wheeled nanocar was 
based on a hopping or sliding mechanism, the trimer should give observable 
translational motions like the four-wheeled nanocar, however, if rolling is 
the operable motion then the nano-tricycle should rotate on an axis, but not 
translate across the surface. The result of the imaging experiment of the 
trimer at ~200 °C ([{link],) yielded very small and insignificant translational 
displacements in comparison to 4-wheel nanocar ((link]). The trimeric 3- 


wheel nanocar showed some pivoting motions in the images. This motion 
type can be attributed to the directional preferences of the wheels mounted 
on the trimer causing the car to rotate. All the experimental results 
suggested that a Cgp-based nanocar moves via a rolling motion rather than 
hopping and sliding. In addition, the fact that the thermally driven nanocar 
only moves in high temperature also suggests that four Cgg have very strong 
interactions to the surface. 
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Cana 


Pivoting Motion 


Pivot motion of the 
trimer. a) - d) Pivot 
motions of circled 


trimered were shown in 
the series of images. No 
significant translation 
were observed in 
comparison to the 
nanocar. Y. Shirai, A. J. 
Osgood, Y. Zhao, K. F. 
Kelly, and J. M. Tour, 
Nano Lett., 2005, 5, 
2330. Copyright 
American Chemical 
Society (2005). 
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The Environmental Impact of the Manufacturing of Seminconductors 

This module gives a brief general overview of semi-conductor 
manufacturing and some of the components and processes used to produce 
them that can potentially cause harm to humans or the environment. 


Note:"This module was developed as part of a Rice University Class called 
"Nanotechnology: Content and Context" initially funded by the National 
Science Foundation under Grant No. EEC-0407237. It was conceived, 
researched, written and edited by students in the Fall 2005 version of the 
class, and reviewed by participating professors." 


What is a semiconductor? 


The semiconductor industry is one of the fastest growing manufacturing 
sectors in not only the United States but also in the world. According to the 
American Electronics Association, the domestic sales of electronic 
components have skyrocketed, jumping from $127 billion to $306 billion 
over the course of the 1980’s. In the first three quarters of the 2003 fiscal 
year alone, the export of technology goods from the United States increased 
by $19 billion [1]. 


The word “semiconductor” technically refers to any member of a class of 
solid, crystalline materials that is characterized by an electrical conductivity 
better than that of insulators (e.g., plastic) but less than that of good 
conductors (e.g., copper) [2]. Semiconductors are particularly useful as a 
base material in the manufacturing of computer chips, and the term 
semiconductor has actually come to be synonymous with the computer 
chips, themselves. However, semiconductors are not only used in 
computers. Computers only make up 44% of entire industry consumption 
(see [link]). Semiconductors are also used for military, automotive, 
industrial, communications, and other consumer purposes. 


Military 
Automotive 1% 
7% 


Industnial 
8% 


Computer 
44% 
Corsumer 
19% 


Communications 
21% 


Relative consumption of semiconductors by 
industry [3]. 


Semiconductors seem to be anywhere and everywhere throughout our 
everyday lives, yet it is surprising how little most people know about how 
they actually work or about the potentially devastating effects their 
manufacturing can have on the environment and human health. 


Why is nanotechnology important to the semiconductor 
industry? 


Much of the study of nanotechnology has been centered on the 
manufacturing of semiconductors. Though there are a number of highly 
anticipated applications for nanotechnology in other fields, notably in 
medicine and in biotechnology, the most tangible results thus far can be 
argued to have been achieved in the semiconductor industry. 


An example of a semiconductor 
(photo from PEAK). 


For example, Intel recently unveiled its first products based on a generation 
of 90-nanometer process technology, and its researches and engineers have 
built and tested prototype transistors all the way down to the 22-nanometer 
range. Currently, Intel scientists and engineers are working on identifying 
new materials such as carbon nanotubes and nanowires to replace current 
transistors, and in particular they hope to develop a “tri-gate” transistor 
approach that would enable chip designers to build transistors below the 22- 
nanometer range [4]. 


With the advent of nanotechnology, these transistors are becoming even 
faster and more powerful, and in accordance with the law of accelerating 
returns, the industry has been producing smaller transistors at lower costs 
with each and every passing year. As these semiconductors become smaller 
and smaller, they are quickly and surely pushing towards the limits of the 
nano-realm. 


These innovations, however, do not come without their fair share of 
challenges. Physical issues that are not problematic at the micron scale arise 
at the nano-scale due to the emergence of quantum effects, and in much the 


same way that optical microscopy cannot be utilized at the nano-scale, the 
semiconductor industry is fast approaching a similar diffraction limit. 
Optical lithography, for instance, a process that uses the properties of light 
to etch transistors onto wafers of silicon, will soon reach its limit. 


At its most basic level, nanotechnology involves pushing individual atoms 
together one by one. Since approximately 1.7 billion transistors are required 
for a single chip, this is obviously not a realistic method for mass 
production. Unless an alternative method for production or a solution to this 
problem is found, the development and manufacturing of transistors are 
expected to hit a proverbial brick wall by the year 2015. This is the reason 
that research in nanotechnology is so important for the world and future of 
semiconductors. 


How are semiconductors manufactured? 


Today’s semiconductors are usually composed of silicon, and they are 
manufactured in a procedure that combines the familiar with the bizarre; 
some steps that are involved in the process are as everyday as developing a 
roll of photographic film while others seem as if they would be better suited 
to take place on a spaceship. 


These semiconductors appear to the naked eye as being small and flat, but 
they are actually three-dimensional “sandwiches” that are ten to twenty 
layers thick. It can take more than two dozen steps and up to two full 
months to produce a single one of these silicon sandwiches. Some of the 
basic and more essential steps involved in the manufacturing process of 
silicon chips are briefly detailed below. 


First, silicon crystals are melted in a vat and purified to 99.9999% purity. 
The molten silicon is drawn into long, heavy, cylindrical ingots, which are 
then cut into thin slices called wafersabout the thickness of a business card. 


One side of each wafer must be polished absolutely smooth. This process is 
called chemical-mechanical polishing, and it involves bathing the wafers in 
special abrasive chemicals. After chemical-mechanical polishing, 


imperfections cannot be detected on the wafers even with the aid of a 
laboratory microscope. 


After a wafer is polished, layers of material must be stacked on top of the 
silicon wafer base. Insulating layers are laid down in alternation with 
conducting layers in a process called deposition. This is often achieved by 
spraying the chemicals directly onto the surface of the wafer through 
chemical vapor deposition. Following deposition, the wafer is coated with 
another layer of chemicals called a photoresist that is sensitive to light. 


Next, a machine called a stepper ({link]) is calibrated to project an 
extremely fine and focused image through a special type of reticle film in a 
manner similar to that of a simple slide projector. The light that is 
transmitted through the reticle is projected onto the photoresist layer, which 
reacts to the light and begins to harden. All of the parts of the wafer 
exposed to this light harden into a tough crust while the parts in shadow 
remain soft. This particular step is known by the name of 
photoelectrochemical etching because it achieves an etching effect, 
resulting in a chip. 


(ASM Lithography) 


An artist’s illustration of a stepper (image 


from Solid State Electronics). 


Hundreds of copies of the chip are etched onto the wafer until the entire 
surface has been exposed. Once this process is complete, the entire wafer is 
submerged into an etching bath, which washes away any parts of the 
photoresist that remain unexposed along with the insulating chemicals 
underneath. The hardened areas of the photoresist, however, remain and 
protect the layers of material underneath them. This process of depositing 
chemicals, coating with a photoresist, exposure to light over a film mask, 
and etching and washing away is repeated more than a dozen times. The 
result is an elaborate, three-dimensional construction of interlocking silicon 
wires. 


This product is then coated with another insulating layer and is plated with 
a thin layer of metal, usually either aluminum or copper. Yet another 
photoresist is laid down on top of this metal plating, and after the wafer is 
exposed in a stepper, the process repeats with another layer of metal. After 
this step has been repeated several more times, a final wash step is 
performed, and a finished semiconductor product rolls off the assembly 
line, at last. 


What is a clean room? 


A typical semiconductor fabrication facility, or “fab” in industry jargon, 
looks like a normal two- or three-story office building from the outside, and 
most of the interior space is devoted to one or more “clean rooms,” in 
which the semiconductors are actually made. A clean room is designed with 
a fanatical attention to detail aimed towards keeping the room immaculate 
and dust-free ({link]). 


An industry clean room at AP Tech (photo 
from Napa Gateway). 


Most if not all surfaces inside these clean rooms are composed of stainless 
steel, and these surfaces are sloped whenever possible or perforated by 
grating to avoid giving dust a place to settle. The air is filtered through both 
the ceiling and the floor to remove particles that are down to 1/100 the 
width of a human hair. Lighting is characteristically bright and slightly 
yellowish to prevent mildew from forming behind equipment or in recessed 
comers, and even the workers in a clean room must be absolutely spotless. 


Workers in these rooms must be covered from head to toe in “bunny suits” 
that completely seal the body in a bulky suit, helmet, battery pack, gloves, 
and boots. Once sealed in these suits, the workers often look more like 
space explorers in a science fiction movie than computer chip employees, 
but in order to even enter the stainless steel locker room to suit up to begin 
with, they must first pass through a series of air lock doors, stand under a 
number of “air showers” that actually blow dust off of clothing, and walk 
across a sticky floor matting that removes grime from the bottom of shoes. 


Semiconductor-manufacturing companies often portray their fabrication 
facilities as being clean, environmentally friendly, and conspicuously free 
of the black, billowing smokestacks that have come to be associated with 


the plants and factories of other major industries. These facilities produce 
no visible pollution and certainly do not appear to pose any health or 
environmental risks. 


In truth, the term “clean room,” itself is more than just a bit of an 
understatement. Industry executives often boast that their clean rooms are 
from 1,000 times to 10,000 times cleaner and more sanitary than any 
hospital operating room. 


What are the health risks involved in the semiconductor 
industry? 


The use of sterile techniques and the fastidious attention devoted to 
cleanliness in the semiconductor industry may perpetuate the illusion that 
the manufacturing of semiconductors is a safe and sterile process. However, 
as arapidly growing body of evidence continues to suggest, hardly anything 
could be further from the truth ({link]). The question of worker safety and 
chemical contamination at chip-making plants has received an increasing 
amount of attention over the course of the past decade. 


4. DANGER 


TOXIC CHEMICAL HAZARD 


WEAR RUBBER DONT BREATHE DON'T INGEST 
GLOVES VAPOR CHEMICAL 


EXPOSURE TO CHEMICAL CAN RESULT IN SERIOUS 
INJURY OR DEATH 


FARSHA 


Chemicals used in the 
manufacturing of semiconductors 
are known to have toxic effects 
(image from FARSHA). 


The devices being built at semiconductor fabrication facilities are super- 
sensitive to environmental contaminants. Because each chip takes dozens of 
trained personnel several weeks to complete, an enormous amount of time 
and effort is expended to produce a single wafer. The industry may pride 
itself on its perfectly immaculate laboratories and its bunny-suited workers, 
but it should be noted that the bunny suits are not designed to protect their 
wearers from hazardous materials but rather to protect the actual 
semiconductor products from coming into contact with dirt, hair, flakes of 
skin, and other contaminants that can be shed from human bodies. They 
protect the silicon wafers from the people, not the people from the 
chemicals. 


Lee Neal, the head of safety, health, and environmental affairs for the 
Semiconductor Industry Association, has been quoted as saying, “This is an 
environment that is cleaner than an operating room at a hospital.” However, 
this boast is currently being challenged by industry workers, government 
scientists, and occupational-health experts across the country and 
worldwide. 


Industrial hygiene has always been an issue in the semiconductor industry. 
Many of the chemicals involved in the manufacturing process of 
semiconductors are known human carcinogens or pose some other serious 
health risk if not contained properly. [link] lists ten of the hazardous 
chemicals most commonly used in manufacturing semiconductors along 
with their known effects on human health. 


Chemical name 


Acetone 


Arsenic 


Arsine 


Benzene 


Role in 
manufacturing 
process 


Chemical- 
mechanical polishing 
of silicon wafers 


Increases 
conductivity of 
semiconductor 
material 


Chemical vapor 
deposition 


Photoelectrochemical 


Health problems 
linked to exposure 


Nose, throat, lung, 
and eye irritation, 
damage to the skin, 
confusion, 
unconsciousness, 
possible coma 


Nausea, delirium, 
vomiting, 
dyspepsia, diarrhea, 
decrease in 
erythrocyte and 
leukocyte 
production, 
abnormal heart 
rhythm, blood 
vessel damage, 
extensive tissue 
damage to nerves, 
stomach, intestine, 
and skin, known 
human carcinogen 
for lung cancer 


Headache, malaise, 
weakness, vertigo, 
dyspnea, nausea, 
abdominal and 
back pain, jaundice, 
peripheral 
neuropathy, anemia 


Damage to bone 


etching 


Creates “holes” in 
silicon lattice to 


Cadmium 
create effect of 
positive charge 
Hydrochloric Photoelectrochemical 
acid etching 
Lead Electroplated 


soldering 


Marrow, anemia, 
excessive bleeding, 
immune system 
effects, increased 
chance of infection, 
reproductive 
effects, known 
human carcinogen 
for leukemia 


Damage to lungs, 
renal dysfunction, 
immediate hepatic 
injury, bone 
defects, 
hypertension, 
reproductive 
toxicity, 
teratogenicity, 
known human 
carcinogen for lung 
and prostate cancer 


Highly corrosive, 
severe eye and skin 
burns, 
conjunctivitis, 
dermatitis, 
respiratory 
irritation 


Damage to renal, 
reproductive, and 
immune systems, 
spontaneous 
abortion, premature 
birth, low birth 


Methyl 
chloroform Wasting 
Toluene Chemical vapor 


deposition 


Trichloroethylene Washing 


weight, learning 
deficits in children, 
anemia, memory 
effects, dementia, 
decreased reaction 
time, decreased 
mental ability 


Headache, central 
nervous system 
depression, poor 
equilibrium, eye, 
nose, throat, and 
skin irritation, 
cardiac arrhythmia 


Weakness, 
confusion, memory 
loss, nausea, 
permanent damage 
to brain, speech, 
vision, and hearing 
problems, loss of 
muscle control, 
poor balance, 
neurological 
problems and 
retardation of 
growth in children, 
suspected human 
carcinogen for lung 
and liver cancer 


Irritation of skin, 
eyes, and 
respiratory tract, 
dizziness, 


drowsiness, speech 
and hearing 
impairment, kidney 
disease, blood 
disorders, stroke, 
diabetes, suspected 
human carcinogen 
for renal cancer 


Chemicals of concern in the semiconductor industry [5]. 


Several semiconductor manufacturers including National Semiconductor 
and IBM have been cited in the past for holes in their safety procedures and 
have been ordered to tighten their handling of carcinogenic and toxic 
materials. 


In 1996, 117 former employees of IBM and the families of 11 workers who 
had died of cancer filed suit against the chemical manufacturers Eastman 
Kodak Company, Union Carbide Corporation, J. T. Baker, and KTI 
Chemicals, claiming that they had suffered adverse health effects as a result 
of exposure to hazardous chemicals on the job in the semiconductor 
industry [5]. The lawsuit was filed in New York, which prevented the 
employees from suing IBM directly. A separate group of former IBM 
workers who had developed cancer filed suit against the company in 
California, alleging that they had been exposed to unhealthy doses of 
carcinogenic chemicals over the past three decades. Witnesses who testified 
in depositions in the New York state court in Westchester County described 
how monitors that were supposed to warn workers of toxic leaks often did 
not function because of corrosion from acids and water. They also alleged 
that supervisors sometimes shut down monitors to maintain production 
rates. When they lodged complaints with senior officials in the company, 
they claim to have been told not to “make waves” [6]. Meanwhile, 70 
female workers in Scotland sued National Semiconductor Corporation, 
another U.S.-based company, claiming that they, too, were exposed to 
carcinogens on the job. 


These lawsuits and the resulting publicity prompted a groundbreaking study 
by the Health and Safety Executive, which commissioned a committee to 
investigate these allegations [7]. The committee found that there were 
indeed unusually high levels of breast and other kinds of cancer among 
workers at National Semiconductor’s fabrication facility in Greenock, 
Scotland. The committee concluded that the company had failed to ensure 
that the local exhaust ventilation systems adequately controlled the potential 
exposure of employees to hydrofluoric acid and sulphuric acid fumes and to 
arsenic dust. These findings proved to be extremely embarrassing for the 
company and for the industry. According to an official statement released 
by Ira Leighton, acting regional administrator of the New England branch 
of the U.S. Environmental Protection Agency, "National Semiconductor is a 
big business that uses a large amount of harmful chemicals and other 
materials. Our hazardous waste regulations were created to properly 
monitor dangerous chemicals and prevent spills. In order for it to work, it is 
important businesses to comply with all of the regulations. When 
companies fail to do this they are potentially putting people — their 
employees and neighbors — at risk [8]. " 


Moreover, a study of fifteen semiconductor manufacturers published in the 
December 1995 issue of the American Journal of Independent Medicine 
showed that women working in the so-called clean rooms of the 
semiconductor fabs suffered from a 14% miscarriage rate. 


Protesters at a rally staged against 
IBM (photo from San Francisco 
Independent Media Center). 


The main problem in prosecution is that the industry does not have a single 
overarching and definitive process for manufacturing, and it is difficult to 
pinpoint one particular compound as causing a certain health problem 
because some plants use as many as 300 chemicals. Also, many of the 
manufacturing processes take place in closed systems, so exposure to 
harmful substances is often difficult to detect unless monitored on a daily 
basis. 


Executives and spokespeople for the semiconductor industry maintain that 
any chip workers’ cancers and other medical problems are more likely due 
to factors unrelated to the job, such as family history, drinking, smoking, or 
eating habits. They also say that over the years, as awareness of chemical 
hazards has grown, they have made efforts to phase out toxic chemicals and 
to lower exposure to others. They insist that they use state-of-the-art 
process equipment and chemical transfer systems that limit or prevent 
physical exposure to chemicals and point out that the substances used in the 
semiconductor industry are used in other industries without a major health 
or safety problem. 


What environmental risks are involved? 


In theory, attention to cleanliness is in the manufacturer’s best interest not 
only from a health perspective but also from an economic. Many chemicals 
used in the production process are not expensive in and of themselves; 
however, the cost of maintaining these materials in an ultra-clean state can 
be quite high. This encourages the close monitoring of usage, the 
minimization of consumption, and the development of recycling and 
reprocessing techniques. Also, the rising costs of chemical disposal are 
prompting companies to conduct research into alternatives that use more 
environmentally friendly methods and materials. Individual companies and 


worldwide trade associations were active in reducing the use and emission 
of greenhouse gases during the 1990’s, and the industry as a whole has 
substantially reduced emissions over the last twenty years. 


Nonetheless, there has been a history of environmental problems linked to 
the industry in Silicon Valley and other technology centers. To begin with, a 
tremendous amount of raw materials is invested in the manufacturing of 
semiconductors every year. 


Moreover, a typical facility producing semiconductors on six-inch wafers 
reportedly uses not only 240,000 kilowatt hours of electricity but also over 
2 million gallons of water every day [9]. Newer facilities that produce 
eight-inch and twelve-inch wafers consume even more, with some estimates 
going as high as five million gallons of water daily. While recycling and 
reusing of water does occur, extensive chemical treatment is required for 
remediation, and in dry or desert areas such as Albuquerque, New Mexico, 
home to plants for Motorola, Philips Semiconductor, Allied Signal and 
Signetics, Intel, and other high-tech firms, the high consumption of water 
necessary for the manufacturing of semiconductors can pose an especially 
significant drain on an already scarce natural resource [10]. The existence 
of economic mainstays including the mining industry and the established 
presences of Sandia National Laboratories and the Los Alamos National 
Laboratory make New Mexico an attractive location for high-tech tenants. 
However, the opening of fabrication facilities in the state leaves its farmers 
and ranchers in constant competition with the corporations for rights to 
water consumption. On average, the manufacturing of just 1/8-inch of a 
silicon wafer requires about 3,787 gallons of wastewater, not to mention 27 
pounds of chemicals and 29 cubic feet of hazardous gases [11]. 


A community near Sutter Creek, 
California that has been designated as 
an EPA Superfund site as a result of 
arsenic contamination (photo from 
Alexander, Hawes, & Audet). 


Contamination has also been an issue in areas surrounding fabrication 
plants. Drinking water was found to be contaminated with trichloroethane 
and Freon, toxins commonly used in the semiconductor industry, in San 
Jose, California in 1981 [12]. These toxins were later suspected to be the 
cause of birth defects of many children in the area. The culprits were 
Fairchild Semiconductor and IBM. The companies’ underground storage 
tanks were found to have leaked tens of thousands of gallons of the toxic 
solvents into the ground. There are a number of semiconductor-related EPA 
cleanup sites in Silicon Valley, and there have been concerns raised about 
the cumulative air and groundwater pollution in Silicon Valley, as well. 


Another area of concern is the eventual fate of discarded electronic systems 
such as computers, pagers, mobile phones, and televisions that contain 
semiconductor devices. Personal computers in particular are especially 
problematic because they become obsolete fairly rapidly and lose almost all 
of their market value within five or ten years after their date of 
manufacture. Tens of millions of PC’s are sold in the United States each 
year, and they pose an environmental risk not only through their sheer bulk 


in city dumps and landfills but also because their semiconducting devices 
often contain significant amounts of heavy metals, including lead and other 
potentially hazardous substances. 


Why don’t we hear more about this on the news? 


Across the United States, approximately 60% of the manufacturing 
facilities for semiconductor devices are located in six states. These states 
listed in descending order are California, Texas, Massachusetts, New York, 
Illinois, and Pennsylvania. The industry appears to be concentrated in these 
particular locations in part because they are near the primary users, 
transportation routes, and experts in the field, but people of all ages in all 
fifty states are impacted by semiconductor technology. Consumerism of 
semiconductor products is only expected to increase in coming years. 
Apple, for instance, expects to have sold 23.6 million iPods, devices that 
rely on semiconductor technology, by the year 2006. 


If semiconductors are so ubiquitous in our day-to-day lives, why is there so 
little awareness about the serious environmental and health risks that are 
involved in their manufacturing process? Part of the problem is that little is 
known about the long-term health or environmental consequences of 
exposure to the chemicals that are used in the process. Because the 
semiconductor industry is still relatively new, not many studies have been 
conducted on this topic, and existing data is often inconclusive. This being 
said, some scientists predict that the cancer rate in the silicon chip industry 
will rise significantly in the future because cancer can take as long as 20-25 
years to manifest itself in populations of exposed workers. 


The EPA does have regulations in effect that are aimed toward the purpose 
of controlling the levels of contaminants released and minimizing human 
and environmental exposure to them. However, current regulations do not 
mandate that American companies report on offshore manufacturing. 
Therefore, even as media coverage and general awareness increase, 
companies can simply outsource more and more of their fabrication 
facilities to, for example, Southeast Asia. Some companies, in fact, have 
begun to do so, and there have even already been studies conducted on the 


health issues of workers in the electronics and semiconductor industries of 
Singapore and Malaysia [13]. 


Thus, changes in how and where semiconductor firms manufacture chips 
currently outstrip the present ability of the United States government and 
media institutions to track and monitor their potential threats to humans and 
the environment. If this situation is to change for the better in the near 
future, it is clear that radical reforms will need to take place on a number of 
different levels. However, the who, what, when, where, and why, so to 
speak, of that reform remains to be addressed. 


Discussion questions 


How many electronics products do you use on a day-to-day basis? 
How many of these products contain semiconductors? 

Who do you think is ultimately responsible for initiating reform? The 
government? The corporation? The consumer? 

Do you think that the health and environmental incidents related to 
semiconductor manufacturing will remain isolated incidents? Or do 
you think that these incidents will become epidemic in the future? 
Do you think that nanotechnology will help the problem or make the 
problem worse? 
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